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nam cttt^t' a c 


mm/~ , /"*/~^m/"< a y~^m 

1 1GCCTGACT 


CAGGGTCACG 


AATGCTCCCA 


TTTGCCAAGA 


AAGCGCCACA 


15240 




LbALL 1 (jL. 1 1 


CCTCATCCGA 


TAAAATCGCC 


TCATCAATAC 


CTGTTTCCAG 


15300 


CZCC& a Ar - * A A a 


IjAG 1L1 GCCA 


AGTGCAAATC 


ACTTAACAAA 


TCCTGCACCT 


TTTCATCTGT 


15360 


a a a a spppha 
AAAAAt— GG 1 A 


TAGACGCGAT 


TCTTGCGAAG 


ATTGCTCCGT 


TGGTGGTGAC 


GAATTTCAGA 


15420 


ill bAl 1 1 CA 


m a a /~» a m f> /~i a 

I AGAGATGGA 


GAAAGGACTC 


ATAGAGGTGA 


CGGGCCAGTT 


TGGCATTTTC 


15480 


1 br 1 LALAAL 1 


G AC AAAGT C A 


AGCCCGAAGT 


CGAGAGACCG 


ATGCTACCAG 


ACATTTTGAT 


15540 


AA 1 GGC AGAT 


AATTCATGCC 


AGCTCAGATG 


GTGTTGGCCC 


AGGATTTCTT 


CTTTTACTGC 


15600 


rp a pm/^m/*' AAA 

1 AC Ibi GAAA 


CTCATTTTTT 


CACCTGTATA 


ATGCGCATCA 


ACTCGTCCAC 


AATCAAATCT 


15660 


L.LA1 CG TGGA 


AGGCACCGCC 


ATTTTCCAGA 


CGAAGGAAGT 


TAGATGAAAT 


CACGCGCGAA 


15720 


AC TTGCTTAC 


AAAG AC C T AC 


AAAATCGTGT 


TCCACTTGCA 


CTAAGTATTC 


ATCAAAACGG 


15780 


I IbbAATTLA 


TGTATTCCTG 


AGGCACTTTT 


TCAATATTCA 


CCAAGACAGT 


GTCGATAAAA 


15840 


GGGCGACCAA 


GGTGACGATG 


CAAGACTTCC 


ACGTGGTCGC 


TATCTGTAAA 


GTGTTCCGTC 


15900 


TCCCCACGTT 


GGGTCATGAT 


ATTGCAGACA 


TAGGCAATTT 


CTGCCTTGGT 


TTCCAAAAGA 


15960 


btttbLLLAA 


TTTCCTTAAT 


CACGATATTG 


GGCAAAATAG 


AGGTAAAGAG 


GGAACCTGGC 


16020 


LLi AGGAC AA 


TCATGTCACT 


TTCAAGGATG 


GTCTGCACTA 


CTCGACGGCT 


GGCCAGAGGC 


16080 


GTATCATCGT 


TTAGGGCATT 


GGTCACATAG 


ACATTGTCAA 


TTATGCCTCG 


ATGGTCTACA 


16140 


AlAlbAL 1 LT 


CTCCAGCCAC 


TTCTGTCCCA 


TCCTGAAAGA 


CTGCATGAAG 


GGTCAAAGGA 


16200 


1 GG 1 CACTGG 


AAGGATAAAT 


TTTCCCTGTT 


GTATGGAAAA 


ATTTGCTCAA 


TAACTGCATG 


16260 


a mm a m a /"»/—• 

GCA1 1ATAGG 


TTGAACCCTG 


CATTTCTGAC 


AAGCCAGCAA 


TGATGAGATT 


TCCCAATGGA 


16320 


1 VjjVjL-L-AvjL^AA 


AGGC TCCGGC 


ATCCTCAGAG 


AACCGATACT 


GAAAGACCTT 


CTCATAAAAC 


16380 


mm a f c c a m a m 
1 1AGGCA1A1 


CCGACATGGC 


CACAAGGACA 


TTACGAAGAT 


CACCTGGCGG 


TGTCAACTGT 


16440 


TG C AT ATT T T 


TTCGGAGTTC 


ACCTGAAGAA 


CCACCATCAT 


CTGCCACCGT 


CACGATAGCT 


16500 


GCGATTTCCA 


CATCTTTTTC 


CCGCAGACTT 


TTTAGAATGA 


CGGGACTTCC 


AGTCCCTCCA 


16560 


CCAATCACCG 


TTATCTTTGG 


TTTTCTCATG 


AACGGTTTAC 


CGTTTCCTTT 


CTGCGGTCTT 


16620 


TGTCGCGATG 


CCCTTCATTA 


ACAGACCAAT 


TCTTGGATAA 


GTCCTGCGCC 


AAGCGTT T AG 


16680 


CAAATGCCAC 


ACTACGGTGT 


TGTCCACCCG 


TACATCCCAT 


GGCAATGGTC 


AAAACGGACT 


16740 


TACCTTCCTT 


TTGGTAACTT 


GGCAGAATCG 


GCTCAATCAA 


GGCCAATAAA 


TGTTGATAAA 


16800 


AGTCTTCTGA 


CTCAGGATGG 


TTCATGACAT 


AATCATAAAC 


AGGTTCATCC 


ACACCCGTTT 


16860 


GGTTTCTCAG 


TTCTGGTAAA 


TAATAGGGAT 


TTGGCAAGAA 


ACGGACATCA 


AAGACCAAGT 


16920 
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CCGCATCAAT 


CGGGATTCCA 


TACTTAAATC 


C G AAAG AC AT 


GACTTCGATA 


CGGAAAGACT 


16980 


GGGCTTGTTC 


TTGGTCTGAA 


AACTGCTCTG 


CAAGGGTTTT 


GCGCAgCTCA 


CGTGGAGTGA 


17040 


GTTCAGTCGT 


ATCCACCACA 


TTTTGGCTCA 


TATTTTTCAA 


AGGTGCCAAG 


AGTTCACGTT 


17100 


CCAACTTGAT 


TCCATCTAAA 


ATACGACCGT 


CTGCTGCTAG 


TGGGTGACTC 


CGTCTGGTTT 


17160 


CCTTGTAACG 


AGCGACCAAT 


TCCTTATCAG 


CCGCATCCAA 


AAAGAGGATT 


TTGAAAT C C A 


17220 


AAC CAT C TTG 


ATTTTCCAAC 


TCATCCAAAA 


CAGCTTGAAT 


CTCTGAAAAG 


AAAGAACGGC 


17280 


TACGCATATC 


CACTACCAAG 


GCCAACTTAG 


GATTGTCTTC 


CTTAATTTCA 


ACCAGCTGCA 


17340 


AAAACTTAGG 


CAAGAGAGCT 


GGCGGCATAT 


TATCAATGGT 


GAAATAACCT 


AG AT C C TCGA 


17400 


AGGACTGAAT 


GGCTACAGTT 


TTCCCTGCGC 


CACTCATCCC 


TGTCACAATC 


ACCAAGTGAA 


17460 


GTTGTTTCTT 


TGTCATCTTT 


TTCTCCTTAT 


ATCAAAAGAA 


GTTTGGCAAC 


ACCAAACTTC 


17520 


AACTAGCTTA 


TCCAATCTCT 


GCGATGACTT 


CAATTTCGAC 


TTTTACATCA • 


CGAGGAAGAC 


17580 


GAGCTACCTC 


CACAGCTGAA 


CGAGCTGGGA 


ATTCCTCTTT 


GAAGGCCGTT 


TGGTAAACCT 


17640 


CATTAAAAGG 


AACAAAGTCG 


TTCATATCGC 


TCAAGAAGCA 


AGTTGTTTTG 


ACAACATGGT 


17700 


CAAAGTCTGT 


TCCTGCTTCT 


GCCAAAATAG 


CACCGATGTT 


TTTCAAGACT 


TGCTCTGTCT 


17760 


GTTCTTGGAT 


ATTCTCTCCT 


ACAATTTCCC 


CAGTTTCAGG 


GGATAGGGGA 


ACTTGACCGC 


17820 


TAGCAAACAA 


AAGGTTGCCA 


ACGATTTTTC 


CTTGAACATA 


GGGTCCGATA 


GCCTTTGGGG 


17880 


CCTTATCTGT 


ATGAATTGTT 


TTTGCCATTT 


TCTTTTCCTC 


ACAATTTTTC 


TAAGATTGCA 


17940 


TCCCAAGCCT 


CATCCATCCC 


TGCCTTACTG 


ACAGATGAAA 


AGAGGATGAA 


ATCGTCACTC 


18000 


GGGTCAAAGT 


TTAATTTCTT 


TTTGATTGCT 


GATTCATGCT 


TGTTCCATTT 


ACCACGAGGA 


18060 


ATCTTGTCCG 


CCTTGGTCGC 


CACAATGATG 


ACTGGAATCT 


CATAATACTT 


GAGAAATTCG 


18120 


TACATCTGCA 


CATCATCTGC 


TGACGGGTCA 


TGACGAAGGT 


CAACTAGACT 


GACAACCGCA 


18180 


CGGAGATTTT 


CCCGAGTCGT 


TAAGTACTCC 


TCAATCATGC 


ACCCCCACTT 


TTCACGTTCC 


18240 


TTTTTAGAAA 


CACGAGCATA 


GCCATAACCA 


GGCACATCCA 


CAAAGCGCAT 


CTTGTCATCA 


18300 


ATGTTAAAAA 


AGTTCAGGAG 


CTGGGTTTTA 


CCAGGTTTTC 


CTGATGTACG 


GGCG AG ATT C 


18360 


TTACGGTTCA 


ACATAGTGTT 


GATAAAGCTG 


GATTTACCAA 


CATTTGAACG 


CCCTGCTAGG 


18420 


GCAATCTCTG 


GCAGTTCATC 


CTGCGGATAG 


TGGGACTTAT 


TAGCTGCACT 


GAGCAAGATT 


18480 


TCAGCATTGT 


GTGTATTAAG 


TTCCATAGTC 


ACCTCTAGGC 


TGTTTCTAGG 


ATCGGTTTAT 


18540 


CCGTTCCATC 


TACAGTTTCT 


TTAGTGATGC 


GAACCAATTT 


CACATTTTCC 


TGACTCGGCA 


18600 


CCTCAAACAT 


GACATCTAGC 


ATGGTTTCTT 


CGATGATGGA 


GCGAAGTCCA 


CGCGCCCCTG 


18660 


TCTTCCGTTC 


GATTGCTTTA 


TTAGCAATCT 


CTTGAAGGGC 


TTCGTCGTCA 


AATTCCAACT 


18720 
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CAAC ATCAT C 


ATAAGAAAGC 


AAGGTTTGGT 


ATTGTTTCAC 


CAAGGCATTT 


CTTGGCTCTT 


18780 


1 LAACjATGCG 


AACCAAGTCA 


TCAACGGTCA 


ATTGCTCAAG 


AGCCGCAAAA 


ACAGGCAAGC 


18840 


GTCCAATCAA 


CTCAGGGATA 


ATACCAAATT 


TTTGAATGTC 


TTCAGCGATG 


ATTTCTTGCA 


18900 


TGTATGAGCT 


GTTTTCGTCA 


ATCGCCTTAT 


TATTTTGACC 


AAATCCGATG 


ACTTTTTCAC 


18960 


CCAGACGTTG 


TTTGACAATT 


TCTTCAATAC 


CAT C AAAAGC 


ACCACCCACG 


ATGAAGAGGA 


19020 


TATTTTTTGT 


ATCCACTTGA 


ATCATCTCTT 


GTTGTGGATG 


TTTGCGTCCA 


CCTTGAGGCG 


19080 


GTACGCTAGC 


AACAGTTCCC 


TCAATAATCT 


TGAGAAGGGC 


TTGTTGCACC 


CCTTCACCAG 


19140 


AAACATCACG 


TGTGATAGAC 


ACATTCTCAC 


TCTTCTTGGC 


AATCTTGTCA 


ATTTCATCCA 


19200 


CATAGATAAT 


GCCACGCTCT 


GCACGTTCGA 


TGTTAAAGTC 


AGCAACCTGC 


AAGAGTTTGA 


19260 


GGAGGATATT 


TTCCACATCC 


TCACCCACAT 


AACCAGCCTC 


CGTCAGAGCT 


GTCGCATCCG 


19320 


C AAT AG C AAA 


AGGTACATTC 


AAGC T C T TAG 


CCAAGGTCTG 


GGCAAGGAAA 


GTTTTCCCTG 


19380 


AACCAGTTGG 


GCCAATCATC 


AAAATGTTTG 


ACTTCTGCAA 


ATCCACATCT 


TCTGACTCTT 


19440 


CGCGTGTATC 


GTGGAAATTG 


ATGCGTTTGT 


AGTGGTTATA 


AACCGCCACT 


GCCAAGGCAC 


19500 


GCTTGGCACG 


ATCTTGACCA 


ATTACATAGT 


GGTTCAAGAT 


ATGGAGGAGT 


TCAATTGGTT 


19560 


TTGGCACCTC 


AGACAAGTCT 


GCCAAGACTT 


CCTCAACCAA 


TTCTTCTCGA 


ATGATTTCCT 


19620 


GAGCTAACTC 


CACGCATTCA 


TTACAAATAA 


AAGCATTGTT 


GCCAGCAATT 


ATTTTTTGTA 


19680 


CTTCTTCTTG 


GTTTTTGCCA 


C AAAAT GAG C 


AAT AAAC CAT 


CAT AT C ATTT 


TTTCTATTTG 


19740 


TAGACATGAT 


TTCCTTCCAT 


TCTATACTGT 


CATTCTATCT 


AAAAT AAGGT 


CATGTAAAAA 


19800 


GCATGAATAC 


TATTGACCAG 


ATTGGTAAAG 


GCATTTAACC 


AAAGG AG GAT 


AGAAAGCCCG 


19860 


TAACGCTTTT 


TACGAAAAGC 


TTGTGCTCCT 


GCCAGAAAGC 


AGATGAAACA 


CAGAAAAGCC 


19920 


GTGAATAGAC 


CAAATAAACT 


CCGTTCCATT 


AGACTTCCTT 


TCTCTTGCGG 


TATTGGATGG 


19980 


T AAAAT CAT A 


AGGATTCTTC 


TCATCTTTGG 


CGTAAAATTT 


GCTTGAAACT 


GTCTCAAAAA 


20040 


GAGACAAGTC 


AAGTTCTTCA 


GGGAAATAGG 


TATCTCCTTC 


CACCCGAGCA 


TGAATGTGAG 


20100 


TGACAATCAC 


TTCATCAAGG 


TAAGGTTCAA 


AAGCCTGAAA 


AATTTGCTTC 


CCACCGATAA 


20160 


TGTAGAGATT 


CTTTTCTTGA 


GCCTGATACC 


AGTCAAGAAC 


AG ACT GG ACG 


TCCTGAAAAG 


20220 


TAGCAACCCC 


ATCTATCTTT 


TCTTCCGGAT 


TACGCGTCAA 


AATCAAGGTT 


TCCCGTTTTG 


20280 


GAAGCAAGCG 


ACGCCCCATC 


CCATCAAAGG 


TCACACGCCC 


CATCAAGATA 


GCATGATTCA 


20340 


GAGTTGTTTC 


TTTAAAGTGC 


TGCAATTCTG 


CTGGCAAATG 


CCAAGGCAGA 


CGATTTTCCT 


20400 


TACCAATCAC 


ACCCTCTTCA 


TCCTGGGCCC 


AAATAGCTAC 


GATTTTCTTA 


GTCATGCTTC 


20460 
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CAT C C T TTTC ACTGATAGTA CTATTTTATC AAAAAACTCA AAAAAAGACT GGTTTGGAAT 2 0520 

AGCTTACAAA ATAGAAAAAA TCTGTAAGAA ATTTCCTACA GATTTATCTA TGTTTCCTTA 2 0580 

TTTCTTACAA ACCAGGTGCT TGTCCAAGTT CGGCTGCAAG CATCCAAATT GTTTTATCTG 2 0640 

TTTCAGTTTT AGCGCCTGCA AAGATACCGT TTGTCACATC GTCACCTTCT TCATCAGTGA 2 0700 

CATCCAAACC TTTTTGGAAA AGTTCTGACA AGTAACGGTA GATAACAAGA ACACGTTCCA 2 0760 

AGCTTTCTTC AACATTACGG TATTCACCAG CTTCTTCTTC GATTTCACTA TTTTGAAGGA 2 0820 

ACTCTGTCAA TGTAGAGAAT GGGCTTCCAC CGAGTGTAAT CAAGCGTTCA CTGATTTCAT 2 0880 

CCAATTGACC GTCAAGAGCT TCCATGTACT CATCCATTTT TGG AT GCC AT ACAAGGAAAC 2 0940 

CACGACCATG C AT AT AC C AG TGCACTTGGT GCAAAGCAAC GTGAGCTACA TACAAATCAG 21000 

CAACAGCTTG GTTCAAGACT TCCTTTGTTT TTGCCAATGC 21040 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 87 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 56: 
ATTCTTAATA CGATTAAAAG GCTTATTACT AAAAGAAAAT TTCAGTTAGA TGAACTAAAC 60 

TTGCTCGTCA AATCCCGATT TAACGAGATG TTTGGGGAAA ATAAAATATT TGAAAGCATT 120 

GATAACTTAT T TG AT ATT AT AGATGGTGAT AGGGGCAAAA ATTATCCTAA ATCAGATGAG 180 

TTGTTTAGTG AGGAGTACTG TTTATTTTTA AATACAAAGA ATGTTACTAA AAACGGATTT 24 0 

TCATTCGATA CAAAGCAATT TAT C ACT AAA ACAAAGGATA AATT ACT T C G AAAAGGCAAA 3 00 

CTTGAGCGTT ATGATATAGT CTTGACAACA AGAGGTACTG TTGGAAATGT AGCGTACTAC 3 60 

GAT G AATT AA TAAAATATAA ACATTTACGT AT AAATT C AG GTATGGTAAT ATT AC GT C C C 420 

AAGACACCAA ATCTAAATCA G AAATT TATT ATCCATGTTT TAAGGAATAA TAATTATAGT 4 80 

CGAGTGATAT CAGGAAGTGC TCAGCCTCAG TTACCAATTA CAAAATTAAA AAAAATACTT 54 0 

CTCCCCCTCC CCCCACTAGC CCTCCAAAAT GAGTTCGCAG ACTTTGTAGT CCAGGTCGAC 600 

AAATCACAAT TGGCAATCCA AAAATCTCTG GAAGAACTTG AAACTTTGAA GAAATCTCTG 660 

ATGCAGGAGT ATTTTGGCTG ATATTCTGCC ATTGTAATTA CGGTAATGAT TTGTTATAAT 72 0 

ACTTCAAAGG AGGAAATCAG ATGGTAGTAA AAACAAGAAA ACAAGGAAAT TCAATCACCA 7 80 

TTACGATTCC AAGTGAATTT AATATTCCAA GTGGTGTTAA ATACGAAGCG AAATTGTTAC 840 
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CAAGTGGTGA GATTATCTTT ACTCCTGAAG AATTGGGGCA GCAGGTTTCT TATGTATCTG 900 

ATGATGCCTT TGACTTAAAT T T AG AT AAAA TATTTGACGA ATACGACGAT GTTTTCAAAG 9 60 

CTTTGGTGGA AAAATGACAA TCTATTTGAC AGAAAAGCAA ATTGAAAAAA TAAATGCTTT 102 0 

AGCAATTCAA CGGTATTCTC CAAATGAGAA AATTCAAACA GTTAGTCCTT CTGCCTTAAA 10 8 0 

TATGATTGTG AACTTACCAG AACAATTTGT CTTTGGGAAG CCTCTTTATC CAACAATTTT 1140 

TGATAAAGCA ACGATACTAT TTGTCCAATT GATAAAGAAG CATGTTTTTG CTAATGCTAA 12 00 

TAAAAGAACT GCTTTCTTCG TTTTGGTCAA ATTTTTACAA TTAAACGGCT ATCGTTTTTC 12 60 

TGTAACGGTA GAAGAAGCAG TAAAAATGTG TGTAACCATC GCAGTAGAAG CTTTAACTGA 13 2 0 

TGAAAAAATG ACAAGCTACT CCAAATGGAT TTCTGAACAT TCTGTTAGAG AAAAGGTCAA 13 80 

AAAGTAACCT AGTATGCTGG ATTTGAATGA GCACAAGAAA ATAAATGAAC AGACAATATT 144 0 

AGAATTCTGT AATGCAGAAA CTGATATTGT CTCTTTTTAT TGATGAATAA GAAAGTGAGA 1500 

AATTATGGAA TCAAAAGTTA CAATTATCAT GCAAGAAATG TTACCTCTTT TAAATAATGA 15 6 0 

ACAATTACTA GCGTTGAGAG AG AGTT T AG A ACATCATCTA GTAGACGGAA AAAAGCAGCA 162 0 

GAAGTATTCG AATAATAACC TGTTGCAACT ATTTATTACC GCCAAGCAGG TAGAGGGCTG 16 80 

TAGCTCAAAA ACAATTCGTT ATTATCAGAG GACGATTGAA AACTTGTTTA ATGCTATTAA 174 0 

AGAGTCTGTG AC AC AAC T C A CAACAGATGA TTTAAGGAGT TATTTAGCAA ATTACCAGTC 18 00 

TGAAAAGGAT TGTAGTAAGG CAAATTTAGA CAATATTAGG CGTATATTGT CTTCTTTTTT 1860 

TGCTTGGCTT GAGCAAGAGG ATATATCATT AAAATTCCCA TTCGACGGAT ACAGAAAATT 19 2 0 

AAGACTGAGC AAAATGTGAA GGAAACTTAT ACTGATGAAC ATTTGGAAAT TATGCGTGAT 19 8 0 

AACTGTGAAA ATTTGAGAGA TTTGGCAATA ATAGACCTAC TAGCATCGAC AGGTATGCGT 2 040 

GTAGGGGAGC TTGTACAGTT GAATCGTTCA GATATTGATT TTGAAAACAG AGAGTGTGTT 2100 

GTCTTTGGTA AAGGAAAGAA GGAGAGACCA GTATATTTTG ACGCTCGTAC GAAAATTCAT 2160 

TTAAGAAATT AT CTT AACG A CAGAAAAGAT AGTCACCCTG CTCTTTTTGT AACGCTAGTT 2220 

GGAAAAGTCC AGAGGCTTGG AATTGCTGGT GTAGAGATTC GCTTAAGAAA GTTAGGAGAC 22 80 

AAACTCGGCA TACAAAAGGT TCACCCACAT AAGTTCAGAA GAACTTTAGC GACTAAGGCA 2340 

ATTGATAAAG GTATGCCTAT CGAACAAGTC CAAAAACTGC TAGGTCA 2 3 87 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10669 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 57: 



ATATTAAAGC 


GACTTTCTGT 


GCGCTAGGGA 


AAAATGTTCC 


TGGG AAT GAG 


GACTTGGTGA 


60 


AGAGGATAAA 


ATCTGAAGGT 


CATGTTGTTG 


G AAAC C AT AG 


CTGGAGCCAT 


CCGATTCTCT 


120 


CGCAACTCTC 


TCTTGATGAA 


GCTAAAAAGC 


AGATTACTGA 


TACTGAGGAT 


GTGCTAACTA 


180 


AAGTGCTGGG 


TTCTAGTTCT 


AAACTCATGC 


GTCCACCTTA 


TGGTGCTATT 


ACAGATGATA 


240 


TTCGCAATAG 


CTTGGATTTG 


AG CTTT ATC A 


TGTGGGATGT 


GGATAGTCTG 


GACTGGAAGA 


300 


GTAAAAATGA 


AG C AT CT ATT 


T T G AC AG AAA 


TTCAGTATCA 


AGTAGCTAAT 


GGCTCTATCG 


360 


TTTTGATGCA 


T GAT AT T C AC 


AGTCCGACAG 


TCAATGCCTT 


GCCAAGGGTC 


ATTGAGTATT 


420 


TGAAAAATCA 


AGGTTATACC 


TTTGTGACCA 


TACCAGAGAT 


GCTCAATACT 


CGCCTAAAAG 


480 


C T CAT G AG CT 


GTACTATAGT 


CGTGATGAAT 


AAGCAAGAAA 


AAATAGGTCT 


GTTAGATATT 


540 


TG AC AG AC TT 


ATTTTTTACA 


GAATATAGTA 


CTACTTAAAA 


AATGTTTTAT 


GCTATAATTG 


600 


ATGAATAAAA 


TAGAAGGAGA 


AGCATATGAA 


T AC C T AT C AA 


TTAAATAATG 


GAGTAGAAAT 


660 


TCCAGTATTG 


GGATTTGGAA 


CTTTTAAGGC 


TAAGGATGGA 


GAAGAAGCCT 


ATCGTGCAGT 


720 


GTTAGAAGCC 


TTGAAGGCTG 


GTTATCGTCA 


TATTGATACG 


GCGGCGATTT 


ATCAGAATGA 


780 


AGAAAGTGTT 


GGTCAAGCAA 


T C AAAG AT AG 


CGGAGTTCCA 


CGTGAAGAAA 


TGTTCGTAAC 


840 


TACCAAGCTT 


TGGAATAGTC 


AGCAAACCTA 


TGAGCAAACT 


CGTCAAGCTT 


TGGAAAAATC 


900 


TATAGAAAAA 


CTGGGCTTGG 


ATTATTTGGA 


TTTGTATTTG 


ATTCATTGGC 


CGAACCCAAA 


960 


ACCGCTCAGA 


GAAAATGACG 


CATGGAAAAC 


TCGCAATGCG 


GAAGTTTGGA 


GAGCGATGGA 


1020 


AGACCTCTAT 


CAAGAAGGGA 


AAATCCGTGC 


TATCGGCGTT 


AGCAATTTTC 


TTCCCCATCA 


1080 


TTTGGATGCC 


TTGCTTGAAA 


CTGCAACTAT 


CGTTCCTGCG 


GTCAATCAAG 


TTCGCTTGGC 


1140 


GCCAGGTGTG 


TATCAAGATC 


AAGTCGTAGC 


TTACTGTCGT 


GAAAAGGGAA 


TTTTATTGGA 


1200 


AGCTTGGGGG 


CCTTTTGGAC 


AAGGAGAACT 


GTTTGATAGC 


AAGCAAGTCC 


AAGAAATAGC 


1260 


AG C AAATC AC 


GGAAAATCGG 


TTGCTCAGAT 


AGCCTTGGCC 


TGGAGCTTGG 


CAGAAGGATT 


1320 


TTTACCACTT 


CCAAAATCTG 


TCACAACCTC 


TCGTATTCAA 


GCTAATCTTG 


ATTGCTTTGG 


1380 


AATTGAACTG 


AGTCATGAGG 


AGAGAGAAAC 


CTTAAAAACG 


ATTGCTGTTC 


AATCGGGTGC 


1440 


TCCACGAGTT 


GATGATGTGG 


ATTTCTAGAA 


AATCATAAAA 


AGAATTGTAC 


ATTATTCTAA 


1500 


TTTTTGATAT 


AATAGTCAGC 


AGGAAAGAAA 


GTCTTATGGC 


GTTCTTCAAG 


CGAGCTTGGG 


1560 


ATAGTGGGAG 


CCAAGTAGGG 


CAAAATAAAG 


GGCTGGCGCT 


TTCTGTAGTA 


TTTTCAAAAA 


1620 
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tAA 1 GAAGTA 


ATAAATTAGG 


GTGGAACCGC 


GTTTCTGACG 


CCCCTAGGTT 


AAATCAACCT 


1680 


AbbAi 1 G I L.A 


GATGTGGTTC 


TTTTGCTTAT 


TCAGTCTATT 


GTGTGAAAGA 


AAGGAGAGCC 


1740 


\j 1 bbALAALL 


TTTATCTTGT 


AAAAG AC GAT 


AGTCAACTAG 


CTACATTTCG 


TGATTTTGTA 


1800 


GTAAGAAATA 


CTGAAAAGTT 


GAAAGATTAT 


CAATCTTTTT 


TAAAGAATGA 


ACTTGCAGTC 


1860 


Tif^m/^ a mm m 7\ /~* 

lbl bATTTAC 


CGCAAGCTGT 


TATTTGGTCA 


GATTTTAATG 


CTGCTACACA 


GATTATTAGG 


1920 


GAAAGTGCTG 


TTCCAACCTA 


TACAAATAAT 


AGACGAGTGG 


TTATGACGCC 


TGATTTAGCT 


1980 


GTTTGGAAAG 


AATTGTATTT 


GTATCAGTTG 


ATGGACTACG 


AGTGTTCTGA 


GCAAACTCAA 


2040 


GCAATAGAAA 


GTCACTATCA 


TTCTTTATCT 


GAAAATTTCC 


TCTTACAGAT 


TGTAGGACAT 


2100 


GAGTTAGCTC 


ATTGGTCGGA 


CATTTTTTAG 


ATGATTTTGA 


TGGTTATGAC 


TCTTATATCT 


2160 


GGTTCGAAGA 


GGGGATGGTT 


GAATATATTA 


GTCGCAAGTA 


TTTCTTGACA 


GAAGAGGAAT 


2220 


TTCAAGCGGA 


AAAAATTTGT 


AATCAATCTC 


TCGTAGAACT 


TTTTCAGAAG 


AAGTATAGTT 


2280 


GGCATTCATT 


GAATGATTTT 


GGTTCTTCGA 


CTTATGATAA 


GAACTATGCA 


AGTATTTTTT 


2340 


ATGAATACTG 


GCGCAGCTTT 


TTGACAGTAG 


AT AAGT TGGT 


AGAAAATTTA 


GGTAGTGTAC 


2400 


AAGCGGTCTT 


AGATTCTTAT 


CATTTATGGG 


CAAATACAGA 


AAAAACTTTT 


CCCTTGTTAG 


2460 


ATTGGTTTGT 


T C AGC AG AAA 


TTAATTGAAA 


AAGAAATATA 


AAAACTAAAG 


GAGTAAACAA 


2520 


TGTCTAAGAA 


ATTAACATTT 


CACTGCATCA 


GTGGCAGAGA 


CCTCCTTACA 


GTCGGGCTGC 


2580 


TCCACGCTCA 


GCACTAGAGT 


GCCTGAGCTA 


GACGCAGTAC 


TAACTCGTCT 


TGCCTCGTAT 


2640 


GATCGACGAG 


GCAGACTCGT 


GTCGCAAGTA 


ATTATTTTTT 


ATTAAGGAGT 


ATTCAATGTC 


2700 


TAAGAAATTA 


ACATTTCACT 


GCGTCAGTGG 


CAGAAACCTC 


CTTACAGTCG 


GACTGCCCTA 


2760 


CGCTCAGCAC 


TAGAGTGCCT 


GAGCTAGACG 


CAGTACTAAC 


TCGTCTTGCC 


TCGTATAATC 


2820 


bALGAbbCAG 


ACTCGTGTCG 


CAAGAAATTA 


TTTTTTATTA 


AGGAGTATTC 


AATGTCTAAG 


2880 


AAATTAACAT 


TTCAAGAAAT 


TATTTTGACT 


TTGCAACAAT 


TTTGGAATGA 


CCAAGATTGT 


2940 


ATGCTTATGC 


AGGCTTATGA 


TAATGAAAAA 


GGTGCGGGGA 


CAATGAGTCC 


TTACACTTTC 


3000 


CTTCGTGCTA 


TCGGACCTGA 


GCCATGGAAT 


GCAGCTTATG 


TAGAGCCATC 


ACGTCGTCCT 


3060 


GCTGACGGTC 


GTTATGGGGA 


AAACCCTAAC 


CGTCTCTACC 


AACACCACCA 


ATTCCAGGTG 


3120 


GTCATGAAGC 


CTTCTCCATC 


AAATATCCAA 


GAACTTTACC 


TTGAGTCTTT 


GGAAAAATTG 


3180 


GGAATCAATC 


CTTTGGAGCA 


CG AT AT T C GT 


TTTGTTGAGG 


ACAACTGGGA 


AAAC C CAT C A 


3240 


ACTGGTTCAG 


CTGGTCTTGG 


TTGGGAAGTT 


TGGCTTGACG 


GAATGGAAAT 


CACTCAGTTC 


3300 


ACTTATTTCC 


AACAAGTCGG 


TGGATTGGCA 


ACTGGCCCTG 


TGACTGCGGA 


AGTTACCTAT 


3360 
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GGTTTGGAGC 


GCTTGGCTTC 


T T AC AT T C AA 


G AAGT AG AC T 


CTGTCTATGA 


TATCGAGTGG 


3420 


GCTGATGGTG 


TAAAATACGG 


AGAAATCTTT 


ATCCAGCCTG 


AGTATGAGCA 


CTCAAAATAT 


3480 


T C AT TT G AAA 


TTTCGGACCA 


AGAAATGTTG 


CTTGAAAACT 


TTGATAAGTT 


TGAAAAAGAA 


3540 


GCTGGTCGTG 


CATTAGAAGA 


AGGCTTGGTA 


CACCCTGCCT 


ATGACTATGT 


TCTCAAATGT 


3600 


TCACATACCT 


TTAATCTGCT 


TGACGCGCGT 


GGTGCCGTAT 


CTGTAACAGA 


GCGTGCAGGC 


3660 


TATATCGCTC 


GTATCCGTAA 


CTTGGCCCGT 


GTCGTAGCCA 


AAACCTTTGT 


CGCAGAACGC 


3720 


AAACG C C TAG 


GCTACCCACT 


TTTGGATGAA 


GAAACAAGAG 


CTAAACTCCT 


AGCAGAAGAC 


3780 


GCAGAATAAA 


GAGAGTGACA 


AATTACGAAA 


ATGGGCGAAC 


AGAGTGAGCC 


CTGAGCCAGT 


3840 


TGCCGCAGTG 


ATGAAGGTAT 


CCTTAGTGAA 


ACTAAGGATA 


CTAGGCAAAA 


TTGGAGACTT 


3900 


TTGGCTCCAA 


TTTTAGCAAT 


GAAACAACGA 


AGTTGGTTGC 


TTGCGTGCCA 


AT C AC AT AAG 


3960 


GCAAACTGGA 


AAATAAAAAG 


AT ACT TTT C G 


GAGAAAAAAC 


ATGACAAAAA 


ACTTATTAGT 


4020 


AGAACTCGGT 


CTTGAAGAAT 


TACCAGCCTA 


TGTTGTTACG 


CCAAGTGAAA 


AACAACTAGG 


4080 


CGAAAAAATG 


GCAGCCTTCC 


TCAAGGGAAA 


ACGCCTGTCT 


TTTGAAGCCA 


TTCAAACTTT 


4140 


CTCAACACCA 


CGTCGTTTGG 


CTGTTCGTGT 


AACTGGTCTT 


GCAGACAAAC 


AGTCTGATTT 


4200 


AACAGAAGAT 


TTCAAGGGTC 


CAGCAAAGAA 


AATTGCCTTA 


GATAGTGATG 


GAAACTTCAC 


4260 


CAAAGCAGCT 


CAAGGATTTG 


TCCGTGGGAA 


AGGTTTGACT 


GTTGAAGATA 


TCGAATTCCG 


4320 


TGAAATCAAG 


GGTGAAGAAT 


ATGTCTATGT 


CACTAAGGAA 


GAAATTGGTC 


AAGCAGTTGA 


4380 


AGCCATTGTT 


CCAGGCATTG 


TGGATGTCTT 


GAAGTCACTG 


ACTTTCCCTG 


TCAGCATGCA 


4440 


CTGGGCGGGA 


AATAGCTTTG 


AATACATCCG 


CCCTGTTCAC 


ACTTTAACTG 


TTCTCTTGGA 


4500 


TGAGCAAGAG 


TTTGACTTGG 


ATTTCCTTGA 


TATCAAGGGA 


AGTCGTGTGA 


GTCGTGGCCA 


4560 


TCGTTTTTTG 


GGACAAGAAA 


CCAAGATTCA 


GTCAGCATTG 


AGCTATGAAG 


AAGACCTTCG 


4620 


TAAGCAGTTT 


GTAATCGCAG 


ATCCATGTGA 


ACGTGAGCAA 


ATGATTGTTG 


AC C AAATC AA 


4680 


GGAAATTGAG 


GCAAAACATG 


GTGTACGTAT 


CGAAATTGAT 


GCGGATTTGC 


TGAATGAAGT 


4740 


CTTGAATTTG 


GTTGAATACC 


CAACTGCCTT 


CATGGGAAGT 


TTTGATGCTA 


AATACCTTGA 


4800 


AGTTCCAGAA 


GAAGTCTTGG 


TGACTTCTAT 


GAAGGAACAC 


CAGCGTTACT 


TTGTTGTTCG 


4860 


TGATCAAGAT 


GGAAAACTCT 


TGCCAAACTT 


CATTTCTGTT 


CGTAACGGAA 


ACGCAGAGCG 


4920 


TTTGAAAAAT 


GTCATCAAAG 


GAAATGAAAA 


AGTCTTGGTA 


GCCCGCTTGG 


AAGACGGAGA 


4980 


ATTCTTCTGG 


CGTGAAGACC 


AAAAATTGGT 


GATTTCAGAT 


CTTGTTGAAA 


AATTAAACAA 


5040 


TGTCACCTTC 


CATGAGAAGA 


TTGGTTCTCT 


TCGTGAACAC 


ATGATTCGTA 


CGGGTCAAAT 


5100 


C AC TGT ACT T 


TTGGCAGAAA 


AAGCTAGTTT 


GTCAGTGGAT 


GAAACAGTTG 


ACCTTGCTCG 


5160 
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TGCAGCAGCC ATTTACAAGT TTGACTTGTT GACAGGTATG GTTGGTGAAT TTGACGAACT 522 0 

CCAAGGAATT ATGGGTGAAA AATACACCCT TCTTGCTGGT GAAACTCCAG CGGTGGCAGC 52 8 0 

TGCTATTCGT GAACACTACA TGCCTACATC AGCTGAAGGA GAACTTCCAG AGAGCAAGGT 53 4 0 

CGGCGCAGTT CTAGCCATTG CAGACAAATT GG AT AC GAT T TTGAGTTTCT TCTCAGTAGG 5400 

ATTGATTCCA TCAGGTTCTA ATGACCCTTA TGCCCTTCGT CGTGCAACTC AAGGTGTGGT 54 60 

TCGTATCTTG GATGCCTTTG GTTGGCACAT TGCTATGGAT GAGCTGATTG ATAGCCTTTA 5 52 0 

TGCATTGAAA TTTGACAGTT TGACTTATGA AAATAAAGCA GAGGTTATGG ACTTTATCAA 5580 

GGCTCGTGTT GATAAGATGA TGGGCTCTAC TCCAAAAGAT ATCAAGGAAG CAGTTCTTGC 5640 

AGGTTCAAAC TTTGTTGTGG CAGATATGTT GGAAGCAGCA AGTGCTCTCG TAGAAGTAAG 5700 

CAAGGAAGAA GATTTTAAAC CATCTGTTGA ATCACTTTCT CGTGCCTTTA ACCTGGCCGA 57 60 

GAAGGCAGAA GGGGTTGCTA CGGTTGATTC AGCACTATTT GAGAATGACC AAGAAAAAGC 582 0 

TTTGGCAGAA GCAGTAGAAA CACTCATTTT AT C AGG AC C T GCAAGTCAGC AATTGAAACA 58 80 

ACTTTTTGCG CTTAGCCCAG TCATTGATGC TTTCTTTGAA AATACTATGG TAATGGCTGA 59 4 0 

AGATCAGGCT GTCCGTCAAA ATCGTTTGGC AATCTTGTCA CAACTAACCA AGAAAGCAGC 6000 

TAAGTTTGCT TGTTTTAACC AAATTAACAC TAAATAAAAT TTGATAAACG GACTTTATCT 6 0 60 

TAT T AC AAAG GAGAAGAAAT GGATCCGAAA AAAATTGCTC GTATCAATGA GCTTGCTAAA 612 0 

AAGAAAAAAA CAGAAGGCTT AACACCAGAA GAAAAAGTGG AACAAGCCAA ACTACGTGAG 618 0 

GAGTACATCG AAGGTTATCG CCGCGCTGTT CGTCACCACA TTGAAGGAAT CAAAATTGTG 62 4 0 

GACGAAGAAG GAAACGATGT TACACCAGAA AAACTACGCC AAGTACAACG TGAAAAAGGA 63 00 

TTACATGGCC GTAGTCTTGA TGATCCAAAT TCATAATAAT ACTCTTCGAA AATCAAATTC 63 60 

AAACCACGTC AGCTTCACCT TGCCGTACTT AAGT AC AG C C TGCGGCTAGC TTCCTAGTTT 642 0 

GCTCTTTGAT TTTCATTGAG TATATGTATT CTTTCTTTTA ACAAAGATAG ATGAAACGAT 6480 

AACAAAGAGA CTAGCAGTTT GTGTTTGCTA GTCTTTTTTC GCTAAAAAAG GAACCATAAT 654 0 

GGTTCCTAAA AACTATCATT AGTAACTTGC ACCGGCTGTA GCGTCTGCGT CACCACCGTG 6600 

GCCTCCAGCA TCCCCTGAAT CAGAAGCGCC AGAAGTAGCA TCGGCGTCTC CATGACCTCC 6660 

GGCAGCAGGA GCAAATGGTC CGCTACCACC CACCAAACGT TGACCAGTCT CTTTTAGGTA 672 0 

CCAGTCAAGC CATGGTTGGA AGTTAAAGAC GATTTCATTG ATACCAGCGT AT G AT CC AT C 67 8 0 

AG G AT AGT AC ATTGCTTGGT AGTTGTGAGT GTTGATAACA CCTGCAGGAG AACCTGGAAC 6 84 0 

GATCGTACGG ACGTATTCTT GGTTTCCGTT GCGAAGTGTT CCGATAACCC ACTCTACGTT 6 900 
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CTTCATACGT GCTGGTGGAA GAGAACCATG AACAGTCGAC ATACGGCTAC CTGATTGAGG 6 9 60 

TGGTACACGT TTAGCGAACA TAGTGTCTGG ATCTTGGTGA GCGTTGTTGT AGTAGAGGAA 702 0 

TTGGTTGTTG TCGTCAGCGT ATGTCAATTC AAATGGCATA GCTTTCAAGA AC AT AT C AAT 70 8 0 

TTGGTTAACT GTTAGGATAC CGTGGTCCAA T T TG AC AT AG GTATCACCAG AAACAGCACC 7140 

AGTGAATGCT GCAACTTTTT CTACCCATTC TGGATCGTCA GGGTCAACTT CTGTGATGGT 72 00 

TGT AG CG ATT GGTTTTCCAC AATCCAAGTC TTCTGATTCG ATTGGTTTTG GTTTTTTCAA 72 60 

TTTCGAAACG ACTCCTACGT ATTTAACAAA GTTATCTAAG CAAGTTTCAA GGAATTTAAC 73 2 0 

AGTGCCTTCG TTGGTGATAT TTCCGTTGTT ATCAAAAGCT TCCTTAGCTT TACCAAGAAG 7 3 80 

GAATTCGTTA CCTGGAAGCG TGTAGGCATT AACACCTGGA GCATCAAGGA TTTTACGAAG 744 0 

GTGAACTTGA GCACGTGATG TTCCTTGGTC ATAGTATGAT GCACCCACAA TCATAACAGG 7 500 

CTTGTTTTCA AATGGATGAA CTTCGTATGA AAGCCATTCA AGT AC AG AT T TGAGTGAAGC 75 60 

TGAGATAGTG TGGTTATGCT CAGGAGTAGC AATGATAACA CCATCTGCAC GAGTAATTTT 7 62 0 

GT TAT AT AAA TAACGTAATT GGAAACTTTC ATCCCATTTT TCATCTTGGT TAAACATTGG 7 680 

AACTTCGTCA ATTTCAAGAA CTTCTAATTC AAATTTGAGT TTGAAGTAGC GACGGATAAA 774 0 

TTCCAAGAGC TTACGGTTAT ATGATTGATC GTAGTTTGAT CCAACAAGTC CAACAAATTT 7 800 

CATTCTTTTT GGTCTCCTAT CTTACAAATT TTCCCAGTCA AAGTCTTCAG CATCTTTGCG 78 6 0 

AAGTAATTCT TGTGCATTAC GTAATTTTTC TGTGATTTTT ACAAAGATAC GGAAGTCATC 792 0 

AAAGATGGCA TCCAATTTCT TGATAACATC AAGGTCAACC AAGTCGCCAC TTGGGTTAAA 7 980 

TGCTTGAAGA GAGTGTGAGA GCAAGAATTC ATCTGGAAGA ACATTTGCCT TGATTTCAGG 804 0 

AGCATTCAAG ATTTGACGAA GTTGCAATTG GGC AC GAG AT GAACCAAGCG TACCGTAAGA 8100 

AGCACCTGTA ATCATGATTG GTTTGTTCAA AAGTGGGTAA ATACCATAAG ACAACCAAGC 8160 

AAGAGCGCTC AT C AAAAC AG CTGGAATAGA GTG AT CAT AC T C AGGAGT AC C G AT AAT AAC 822 0 

GCCATCTGCC TCTTCGATTT TAG C AG C AAT TTCCAATATT TCAGCAGGTA CTTGCTTGTC 82 80 

AGCTGGTTTG TTGAAGACAG GAATGGCCTT GATTTCAACA AGTTCAATTT CAGCTTTGTC 834 0 

AGTAAAGTGT TTTTGCATGT ATTGAAGCAA TTGACGGTTT GTAGAACGTT TTGAATTTGT 84 00 

TCCAACAATA GCAATAAGTT TTAACATGAG ATTTCCTTTC TCTTTTTACA TAATACAATT 84 60 

TTAAAATTCC ATT G AAAC AG TTGTCTCTAT AGAGTAGGAA TTCCTGAAGA ACAGCTTAGG 852 0 

TGGCCTTCTT TATCGATGAG GATGACTTCG ATGCCCTCCA AACTTTCGAC TTGCCAGAGG 8580 

ATAGAAGCAG GTCTTTCTCC AAAGAGTCGA GTCGTCCAGA TTTCGCCATC GACTGATTTA 8640 

TCAGAGATGA TTGTTAGACT CGCTAGTTCC GTTTCAACAG GATATCCTGT TTGACTGTCA 8700 
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AAAATGTGAT GGTAATCTTG TCCATCGACG GTCAGGTGAC GTTCATAAAT GCCTGAAGTC 87 60 

ACGACAGATT TATTGACAAC AGGGATGGTC ATTAAATGAT TTCCCCTAGG ATTGGCTGGG 8820 

TCTTGAATCC CGATTTGCCA TGGGTTATCC CCTCTTGCCT GATTTTTTCC AATGGTCAGG 8880 

ATATTCCCTC CCAGATTGAT CAAGGCAGAA GTCACCCCCT CTTTCCTAAG AAATTGGGCA 8940 

ACCTTATCCG CACTGTATCC TTTGGCTAAA CAACCTAGAT CGATCTTCAT TCCTTTCTGT 9000 

TTTAAAAACA CAGTAGAAGT AGAAGAATCT AACTCGATAC CATGAGGATT GATTAGAGGC 9060 

AGCACCGATT CAATTTCTTG AGGCTGGGCG ACCTTGGCAT CTGAAAAACC GATACGCCAG 9120 

GTTTGAATTA AGGGACCAAT GCTGATATTG AGGTGGCTAG AGAGCGCTAG GCTATGCTCT 9180 

AACCCAAGTG AAATCAGCTC AAACAGGTCT GGATGAACCG TGACGGGGGC TATTCCTGCT 9240 

TGATAATTGA TTTCCATCAA CTCAGATTCT TGACTATTGG CGTTGAAGCG GTATTCAAGT 9300 

TCTTTGAGCA AGTCAAAGGA TTTTTGGAGA AAGATATCGG CTTGCTCATC CACTAATGAA 93 60 

ATAGTGATAG TAGTCCCCAT TAG C C GTT C A GAATGTGAAC GAAGAGTCAA GCTACCAACT 942 0 

CCTTTCTCTT ATAGAAAATA AGTTGTAATA TCAAATAATC ATCTAAATTG AAGCCCTTAC 9480 

ATTTCATTTT CAT GTT ATT A TAATACCATA AAGTTAGAAT TTTCACAAAC AAAATTTGGA 9540 

AAAAGTCAAG AAATATGCTC ATAAAATTCA TCAGGCTTGA AAACAGGATA AATGGGGAAT 9 600 

TATTTTTGAT AAAAAATGCT GAAATAATAG TACCCCCCTT GTAAACGCTA ACGGTAAATG 9 6 60 

GTATACTAGT AAGGTAAATT TAGAATGAAG GCAGGAAATT TTTATGAGTA AAATCGTTGT 97 2 0 

AGTCGGTGCT AACCACGCTG GTACAGCATG TATCAATACC ATGTTGGATA AT T T TGG AAA 9780 

TGAGAACGAA ATTGTTGTAT TTGACCAAAA CTCTAACATC TCTTTCCTAG GATGTGGAAT 9 840 

GGCTCTTTGG ATTGGTGAAC AAATTGACGG TGCTGAAGGC TTGTTCTATT CTGATAAAGA 9900 

AAAATTGGAA GCTAAAGGTG CTAAAGTTTA CAT G AACT C A CCTGTTCTTT CAATCGACTA 9960 

T G AT AAC AAA GTAGTTACAG CGGAAGTTGA AGGAAAAGAG CACAAAGAAT C AT ACGAAA A 10020 

ATTGATTTTC GCTACAGGCT CTACACCAAT CTTGCCACCA ATCGAAGGTG TTGAAATTGT 1008 0 

TAAAGGAAAC CGCGAATTTA AAGCAACTCT TGAAAACGTA CAATTCGTGA AATTGTACCA 10140 

AAATGCTGAA GAAGTTATCA ATAAACTTTC TGACAAGAGC CAACACCTCG ACCGTATCGC 10200 

CGTTGTTGGT GGTGGTTACA TCGGTGTTGA ACTTGCTGAA GCCTTTGAAC GTCTTGGAAA 10260 

AGAAGTTGTC CTTGTTGATA TCGTTGATAC TGTCTTGAAC GGTTACTATG ACAAAGACTT 1032 0 

CACACAAATG ATGGCGAAGA ACTTGGAAGA T C AC AAC AT C CGCTTGGCTC TAGGTCAAAC 103 80 

TGTTAAAGCA ATCGAAGGTG ACGGTAAAGT TGAACGCTTG ATTACTGACA AAGAAAGCTT 10440 
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TGACGTGGAT ATGGTTATCC TTGCAGTTGG TTTCCGTCCA AACACAGCCC TTGCAGGTGG 10500 

TAAGATCGAA CTCTTCCGCA ACGGTGCCTT CCTTGTAGAC AAGAAACAAG AAACATCTAT 10560 

CCCAGACGTT TACGCTGTTG GTGACTGTGC GACTGTTTAT GACAATGCTC GTAAAGATAC 10 62 0 

AAGCTATATC GCTCTTGCTT CAAATGCTGT GCGCACTGGT AACGTTGGT 10669 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7542 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 58: 

CGCGCTAATA GATACTTTAT GATAGAATAA AGAACAAGAT TGACAAGTAA GAGGAAACAT 6 0 

TATGCAAAAT CAAACACTCA TGCAATACTT TGAATGGTAT CTGCCCCACG ACGGTCAACA 12 0 

CTGGACGCGT CTGGCTGAAA ATGCTCCACA CCTAGCTCAT CTGGGGATCA GTCACGTCTG 180 

GATGCCACCA GCCTTCAAGG CAACCAACGA AAAAGATGTC GGCTATGGGG TCTATGACTT 24 0 

ATTTGACTTA GGAGAGTTCA ACCAAAAAGG GACTGTCCGC ACCAAGTATG GTTTCAAAGA 3 00 

AG AC TAT CT T CAAGCCATTC AAGCCCTTAA AGCACAGGGA ATTCAACCTA TGGCCGATGT 3 60 

AGTTCTCAAC CACAAGGCTG CTGCCGATCA CAGGGAAGCC TTTCAGGTTA TCGAAGTTGA 42 0 

T C CTGT AG AC CGTACAGTTG AAC TTGG AG A ACCCTTCACC ATCAATGGCT GGACTAGTTT 4 80 

TACCTTCGAT GGTCGCCAAG ATACCTATAA TGGCTTCCAC TGGCATTGGT ACCACTTCAC 54 0 

CGGTACAGAC TACGATGCCA AACGCAGTAA ATCTGGGATT TATCTGATCC AAGGGGACAA 6 00 

CAAGGGCTGG GCCAACGAGG AATTGGTCGA TAACGAAAAC GGAAACTACG ACTACCTCAT 6 60 

GTATGCCGAC CTAGACTTTA AACATCCTGA AGTCATCCAA AACATCTATG ACTGGGCTGA 720 

TTGGTTCATG GAAACGACTG GTGTAGCTGG TTTCCGTTTG GATGCCGTTA AG CAT AT T G A 7 80 

CTCTTTCTTT ATGCGCAACT TCATCCGCGA TATGAAGGAA AAATACGGTG ACGATTTCTA 84 0 

TGTTTTTGGT GAATTTTGGA AC C C AG AC AA GGAAGCCAAT CTGGACTATC TCGAAAAAAC 900 

GGAAGAACAC TTTGACCTTG TCGATGTTCG TCTCCACCAG AATCTCTTTG AAGCCAGTCA 9 60 

AGCTGGCGCA AACTATGACC TTCGTGGCAT TTTCACAGAT AGCCTGGTTG AACTCAAGCC 102 0 

TGACAAGGCT GTGACTTTTG TCGACAACCA CG AT AC C C AA CGAGGACAAG CCCTTGAGTC 1080 

TACCGTTGAA GAATGGTTCA AG C C AG C AG C CTATGCCCTC ATTTTGTTAC GCCAAGACGG 1140 

CCTTCCATGT GTCTTTTACG GAGACTACTA TGGGATTTCA GGGCAGTATG CTCAAGAAGA 12 00 
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TTTCAAAGAA ATCCTTGACC GCCTCCTAGC CAT C C G AAAA GATTTGGCCT AT G G AG AAC A 12 60 

AAATG AC T AC TTTGACCATG CTAACTGTAT CGGTTGGGTA CGTTCAGGTG CTGAAAATCA 13 2 0 

ATCCCCAATC GCAGTCCTTA TCTCAAATGA CCAAGAAAAC AGCAAGTCAA TGTTTGTCGG 13 80 

TCAAGAATGG ACTAATCAAA CCTTTGTAGA TTTACTTGGT AACCACCAAG GTCAAGTTAC 1440 

AATTGATGAG G AAGGTT AT G GACAATTCCC TGTCTCAGCT AGATCCGTAA GTGTCTGGGC 1500 

AGTCAATACC ATCTAATAGC TCATAATAAC CAAGCTAGGT CCAAGCGGAT TTGGCTTTTT 1560 

TGTATTCACA AAAAGACCTA CCCAAATGGA TAGATCTTTA CTTGATTACA ATTTACCTGC 162 0 

TACTGCATCC AACAATTCTT GGATCTTAGG TTGGTTGCTT CCTCCTGCCA TGGCCATATC 1680 

TGGTTTACCA CCACCACGTC CATCGATGAT TGGTGCTAAT TCTTTGACAA GGTTTCCTGC 1740 

ATGAAGGTCT TTTGTCTTGC TTGCTACAAG G AC AT TG AC T TTGTCACCGA TAGCGGCAAC 18 00 

TAGGACAAGA AGATCAGAGT AGTCTTTTTG TTTCCAGTTA TCTGCAAAAG TACGAAGGGC 1860 

ACCGGCATCG GATACAGACA CT T G AC TAG C AATGTAACGA TGACCGTTGA CTTCCTTAAC 19 2 0 

ATCTTTGAAG ATATCGCCTG CGGCTGCAGC TGCGGCTTTT TCTTTCAACT CAGCATTTTC 19 80 

TTTTTGAAGT TGACGAAGTT GTTCTTGAAG TCCTTCTACC TTGTGAGGTA CTTCCTTGAC 2 040 

TTGAGGTGCT TTCAAGGTTG CTGCGATAGC TTTAAGAGCA TCCTCTTGTT CACGATAGGC 2100 

TTCAAAGGCT TCCTTACCAG TCACTGCCAA GATACGGCGA GTTCCTGAAC CGATTCCTTC 2160 

TTCTTTGACA ATTTTGAAGA GACCAATCTC AGAAGTGTTG T C AAC AT GAG TACCACCACA 2 22 0 

AAGTTCAATA GAGTAGTCAC CGATAGTCAC GACACGAACT TCCTTGCCGT ATTTCTCACC 2280 

AAAGAGGGCC ATAGCTCCCA TTTCTTTAGC AGTGTCAATA TCCGTTTCAA CTGTCTTCAC 2340 

TTCAAGTGCT TCCCAAATTT TCTCGTTAAC TTGCTGTTCA ATCGCACGAA GTTCCTCAGC 2 4 00 

AGTTACTGCT TGGAAGTGGG TAAAGTCAAA GCGAAGGAAT TCAACTTCGT TAAGAGATCC 2460 

TGCCTGTGTT GCGTGGTTTC CAAGGATATT GTGAAGGGCA GCGTGAAGCA AATGAGTCGC 2 52 0 

AGTGTGGTTT TTCATGACAC GGTGACGGCG ATTGCTATCA ATTGCCAAGG TAT ATT CT TG 2 580 

GTTCAAGGCA AGCGGTGCAA GGACTTCAAC TGTATGAAGG G CT TG AC C AT TTGGGGCTTT 2 640 

CTGAACATTG GTCACAGTAG CCACAACCTT ACCTGACTCA TCCAAGATTT GTCCGTAGTC 27 00 

AGCTACCTGT CCACCCATTT CAGCATAAAA TGACGTTTCC GCAAAGATAA GAGAGGCAGT 27 60 

TCCTTCTGAA ACAGCTCCTA CTTCTGCATT GTCAGCAACG ATAGCTACCA ATTTAGAAGA 2 82 0 

CAATTGGCTA GCATTGTAGT TGAAGACACT TTCTACAGTG ATGTTTTGAA GAGTTTCATT 2 880 

TTGCATACCC ATTGAGCCAC CCTTGACAGC TGACGCACGC GCGCGTTCTT GCTGTTCTTT 2 940 
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CATGGCTGCT TCAAAACCTT CACGGTCTAC AGTCATACCA GCTTCTTCAG CGATTTCTTC 3000 

AGTCAATTCA ACTGGGAACC C AT AAGT AT C ATAGAGTTTG AAGACATCTG AACCAGCGAT 30 60 

AACAGATTGA CCTTTTTCTT TCAAGTCTGC TACAATGCCT TGGGCAAAGT GTTGACCTGA 312 0 

GTGAAGGGTA CGGGCAAATG ATTCTTCTTC GCTCTTAACG ATTTTCTCAA TAAAGTCACG 3180 

TTTCTCAAGC ACTTCTGGGT AGTAGCTTTC CAT G AT TT T T CCAACAGTTG GAACCAATTT 3 2 40 

GTAAAGGAAA GGCTCGTTGA TACCCAATTT TTGACCATGC ATAGAAGCAC GACGGAGAAG 3300 

ACGACGAAGA ACATAACCAC GACCTTCATT TCCTGGAAGG GCACCATCAC CGATAGCAAA 3 360 

TGAAAGAGAA CGAATGTGGT CTGCGATAAC CTTGAAGCTC ATGTTGTCGC CATCTTGGTC 3 42 0 

ATAAACCTTA CCAGACAATT TCTCGACTTC ACGGATAATC GGCATGAAGA GGTCCGTTTC 34 80 

AAAGTTGGTC TTAGCCCCTT GGATAACGGC CACCAAACGC TCCAAACCAG CGCCCGTATC 3540 

AATGTTCTTA TGTGGCAATT CCTTGTATTC GCTACGAGGA ACAGCAGGGT CTGCGTTAAA 3 600 

T TGTG AC AAA ACGATGTTCC AGATTTCAAT ATAACGGTCG TTTTCAATAT CTTCTGCAAG 3 660 

CAGGCGAAGA C CG AT AT T TT CTGGGTCAAA GGCTTCCCCA CGGTCAAAGA AGATTTCTGT 3 72 0 

ATCTGGTCCA GAAGGTCCCG CACCGATTTC CCAGAAGTTG TCCTCAATTG GAATCAAGTG 3 7 80 

ACTTGGATCC ACTCCCACTT CAATCCAGCG GTTGTAAGAA TCTTTATCGT CTGGATAGTA 3 84 0 

GGTCATGTAA AGTTTTTCAG CAGGGAAATC AAACCATTCA GGGCTTGTCA AAAGCTCATA 3 900 

AGCCCAAGTG ATAGCTTCGT CACGGAAGTA ATCCCCGATA GAGAAGTTCC CCAGCATTTC 3 9 60 

AAACATGGTA TGGTGACGCG CGGTCTTCCC TACGTTTTCG ATGTCGTTGG T ACGG AT AG C 4 020 

CTTTTGGGCA TTGGTAATAC GTGGATTTTC AGGGATAATG GTCCCGTCAA AGTATTTCTT 4 080 

AAGGGTTGCT ACCCCAGAGT TGATCCACAA AAGAGTTGGG TCATTTACAG GAACCAAACT 414 0 

TACTGATGGT TCTACTGAGT GACCTTTGGT CGC C C AG AAA TCAAGCCACA TTTGGCGTAC 42 00 

TTGTGCACTA GATAGTTGTT TCATATTGTC TCCTTATTCA CTTGTTTAAT GTGATTGGCT 42 6 0 

TTCCAGCATT TCCACATAGT CAATCGCGAC ACAGAGGGAA ATGACTAGGT CTGCATAAGC 43 2 0 

GTCTTCAAGA ACCGTTACGG TATAGGTAGA AGTCAGATGG AAGAGTTCCT TCTTAATTTC 4 380 

CGCAATCAAC TGATCGCGAT CATCCAGCAA TTTGAAATTC AAATCCCAGA TATTGCCCTC 4440 

GATACGAAGA CCTAGATTAT CAAACTCATA CTTATCTCGC CAGAAGGTCA ACTTCTTACG 4500 

AATGACAAAA CTCGAGCCAT CCCGAAGCTG AATTTCAAAA CG AGGAAG C A AGGTCAAGAT 4560 

TTCTTTACTA ATCTCACTGA CTTGTTCACC AGCCGCATCA TAGATGGTAA AGGTTTTAGG 4620 

AATCTTAAAA AATGATCCCT C C AC C T GAT A GGCAATTTCT CCCCTGTCAT CCTTGATAGC 4 680 

GAAGCGTTCG CCTCCAAGAC GAAACTTTTG TTTGACAAGA AATGTTTTCA TCAACACCTC 4740 
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CAAAAATCAA AAGACAAGCT C AT ATC AC G A AGGGCGAAAA ACCGCGGTAC CACCTTCATT 4 800 

CAATGAACTT GTCATTCTCT TGTTCTTATG CAATTGTATG ATTGAGTAGC ATGACTTCCT 4860 

AG C TT AG AT G GCTCGCAGCA CCGCCATTTC TCTGGACTAA GACAAGTGAA AATCAATTCT 4 92 0 

CAACTTTCTT ATTATAACGT TTTTTTAAGC TTGCGTCAAC TGGAAATGAT CTCCGTTGAA 49 80 

TTAGACCAAT TCCCTACATC TCTGATTACT TTTTCAGGAT ATATTTTTTC TTACTGCCAT 504 0 

TTTTCTTTTT ATCCCAAATT TTCATATTAC TAAACACAGC TACTAGAATA TTTCCAAATA 5100 

TAAAGGTGCC TATCACCCAA TATATGGACT CAGTTGTTAG GTATTGTCGA TCCAAGCCAT 5160 

CCTTTAAATG GAATAGTATA GCAGTTTGGT TAACAATCAT AAAGGTTGGC CAGAAACTTT 522 0 

TTTTGAAAAA AGTAGACATT TTCATTATTT GTTGCCGCTT TCTGTAAGGT TAATACTCAA 52 80 

TAAAAATCAA AAAGCAAACT AGGAAGCTAG CCTCAAGCTG TACTTGAGTA CGGCAAGGCA 53 40 

ACGCTGACGT GGTTTGAAGA GTATAGGCTT AGTATACTAC TAG G C AAG C A AATAAACAAA 5400 

TAAACAACTA GAATAGAAAA AGATAGGGCT CTAAAAACTG ACTTCTATTC CTTAAAAACG 54 60 

AACCAGCTTG ACTGATTCGT CTTCTTACGT TTATCTCCTA C TT C C GAT AC ATTTTAAACT 552 0 

GTAGGAAGAG GTCGCTATAT TTCCCTGTCC ATTTATGGTC AAATTTCTCA TAAACTTCTA 5580 

GGTGTTTCAT GGTTTCAACA TCGGGATAGA AGGCCTTATC TTCCTTTGTT TCCTCTGGGA 5 64 0 

GCAATTCCTT CGCTGGTAGG TTTGGTGTTG AATAGCCGAC ATACTCCGCA TTTTGGAGAG 5700 

CATTTTCAGG TTTCAACATA AAGTTGATAA AGGCATAGGC TGAGTTTTGG TTTTTAACTG 57 60 

TTTTGGGAAT GACCATATTG TC AAAC C AAA GATTGCTGGC CTCTGTCGGT ACCACATAAC 5820 

GTAGATTTTC ATTTTTTTCT AACATTTGGC TGGCTTCACC AG AG AAGG T C ACGCCGATTG 5880 

CAACATTATT CTGAATCATA TAGCCCTTCA TCTCGTCCGC AACG AT AG C C TTGATATTTG 5940 

GAGTCAGTTT GTAGAGCTTA TCCACTGTCT CTTCCAACTG CTGCAGATCC TTGGAGTTGA 6 000 

GGCTGTAGCC GAGGGAATTG AGTCCTAGTC CCAGCACCTC ACGCGCCCCA TCAAAGAGCA 6060 

TGATAGAATT CTTATACTCC GGCTTCCAAA GGTCATCCCA ATGCTCAGGC GCTTCATCTA 612 0 

CCATGGTTTC GTTGTAGACA ATTCCTAAGG TTCCCCAGAA GTAAGGGATG GAGAATTTAT 6180 

TACCTGGGTC AAAGGACTGG TTGAGAAACT CTGGTCCGAT ATTTTCGATT CCTTCAATTT 6240 

TTGAATAATC AAGCGGAACC AAGAGGTCTT CGTCCTTCAT CTTGTTAATC ATGTATTCAC 6300 

TTGGAATGGC AATATCGTAG GTCGTTCCAC CCTGCTTTAT CTTAGTGTAC ATGGCTTCGT 63 60 

TGGAGTCAAA AGTCTCGTAC TGAACTTGAA TTCCTGTTTC TTCTGTAAAC TGAGTCAAGA 642 0 

GTTCAGGATC GATATAGTCT CCCCAGTTAT AGATAACCAA TTTTTGACTA TC T CG ACT AT 64 80 
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TGATTTTACT AT C T AAATGA GTCGCAATTC CCCACAAGAC AAGGATAATC GCTGCAATTC 654 0 

CTGCTAAAAA TGAATAGATT TTTTTCATGC TTGCTCCTCC TTCTCACGAG AGATAAAGTA 6600 

ATAACCTACA ACTAGGATAA TACTAAAGAG AAAG AC TAG A GCAGACAGGG CATTGATTTC 6660 

TAAGGAAATC CCCTTGCGAG CACGAGAGTA AATCTCGACT GATAGGGTTG AAAAGCCATT 67 2 0 

TC CTGTT AC A AAGAAGGTCA CGGCAAAGTC ATCTAACGAA TAGGTGAAGG CCATGAAATA 67 80 

ACCAGTAATG ATAGACGGAG TCAGGTAAGG AAGCATGATT TCCTTGAACA TCTGAAATTG 6 84 0 

ACTAGCTCCC AAGTCATAGG CCGCATGAAT CATGTCGCCA TTCATTTCCT TGAGTCGAGG 6900 

CAAGACCATC AAGACCACGA TAGGAATGGA GAAGGCCACG T G ACT AG AT A GAACGGTCAA 6960 

AAAGCCAAGT GAAAACTTGA GTTGGGTAAA GAGAATCAAG AAGCTAGCAC CAATCATAAC 7 02 0 

GTCAGGCGCA ACCATGAGGA TATTATTGAG TGATAGAAAG GCTTCTTGGT ATTTCTTACG 7080 

AGACTGGTAG ATGTAAATGG CACCAAAAGT CCCGATAATG GTCGCTATCA AGGCTGATAG 7140 

GAAGGCCAAG AAAAATGTCT GAGCCAAAAT CAGCATGAGT CTCCCATCTC CAAACATGGT 72 00 

TTCAAAGTGA GTCCAGCTAA AACCTGTAAA GCTATTCATA TCATCACCAG CATTAAAGGC 72 60 

AT AG C C AATC AAGTAAAAGA TAGGCAGGTA GAGGACCAGA AAGACCAGTC C C AG AT AAAG 732 0 

GTTGGCAAAT TTTTTCATCG TTCTCTCCTT TCCTTAGTCA CCCACATGGT GATGAACATG 73 8 0 

GTCAGGATGA GAAT C AC AC C GATGGTTGAA CCCATACCAT AGTTGTCATT GGTTAGAAAA 744 0 

TTCTGCTCAA TAGCCGTCCC CAAGGTGATA ACGCGTTCCC ACCAATCAAA CGGGTCAGCA 7 500 

TGAAGAGACT CAAACTTGGG AT AAAG AC CG ACTGAACCCC GG 7542 



(2) INFORMATION FOR SEQ ID NO: 59: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
AAAACCAAAT TCCGGTATTT TAACCTATGC TGTAAATACC ATGAAGTCTG T C ATG AC AG A 6 0 

TCAGGTCTAT AACATTAAGG TTGAGACAGA AAATGGAAAT TATGTTGGTG AAGCTAGCCA 12 0 

TGTTTTGGTC CTTTTGACAA ATTACTTCGC TGATAAGAAA ATCTTTGAAG AAAACAAGGA 180 

CGGCTATGCC AACATTTTGA TTCTGAAAGA TGCCTCTATA TTCTCCAAAT TATCCGTCAT 24 0 

TCCTGATTTA TTAAAAGGGG ATGTTGTCGC AAATGATAAT ATCGAGTATA TCAAAGCGCG 3 00 

TAATATTAAA ATCTCTTCAG ATAGTGAATT GGAGTCAGAT GTTGACGGAG ATAAATCAGA 3 60 
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TAACCTACCT GTAGAAATCA AAGTCCTAGC TCAGCGAGTA GAAGTATTTT CAAAACCGAA 42 0 

AGAGGATTAG TATATAGAGA AAGCCTT TTT TAAGGCTTTT TGTATACTTT AAAAGATAGT 480 

TCCTTTAACA ACGGACATTC CTTGCAAATA GTTTTACAAA AATAGTATAC TGGATTCATT 54 0 

GAGTTTGAAA ACGTTTGCGT AAAATTTGAA TGAATACTTT AGGAGACAAA TTGATGGAAT 600 

TGAGTGCTAT TTACCATAGG CCTGAGTCGG AGTATGACTA TCTTTATAAG GATAAGAAAC 660 

TCCATATTCG AATTCGAACT AAGAAAGGGG ACATTGAAAG CATCAACTTG CACTATGGGG 72 0 

ACCCTTTTAT CTTTATGGAG GAGTTTTATC AGGATACAAA AGAAATGGTC AAGATAACTT 7 80 

CTGGTACCTT ATTTGACCAT TGGCAGGTTG AAGTGTCAGT TGACTTTGCA CGTATCCAGT 840 

ATCTCTTTGA GCTCAGAGAT ACAGAAGGTC AAAATATTTT GTATGGCGAT AAAGGGTGTG 900 

TGGAAAATTC TCTAGAAAAT CTTCATGCAA TTGGGAATGG ATTTAAGTTG CCTTAGCTTC 960 

ATGAG ATT G A TGCCTGCAAG gTTCCTGACT GGGTTTCAAA TACGGTATGG TATCAGATAT 1020 

TTCCTGAAAG ATTTGCCAAT GGCAATGCTC TATTAAACCC AGAAGGGACT TTAGACTGGG 10 80 

ATTCATCTGT CACACCTAAG AGCGATGATT TCTTTGGTGG T GAT T T AC AG GGGATTATTG 1140 

ATCATATGAA TTACTTGCAA GACTTGGGTA TTACTGGACT ATATCTTTGT CCCATCTTTG 1200 

AATCTACAAG CAATCACAAG TACAATACGA CAGATTACTT TGAAATTGAC CGTCATTTTG 12 60 

GAGACAAGGA GACCTTTCGG GAACTGGTGG ATCAAGCGCA TCATCGTGGC ATGAAAGTCA 1320 

TGCTGGATGC GGTATTTAAT CATATTGGTT CGCAATCTCT TCAATGGAAA AATGTCGTCA 13 80 

AAAATGGTGA ACAGTCTGCT TATAAGGATT GGTT C CAT AT TCAACAATTC CCAGTGACAA 1440 

CTGAAAAGCT AGTTAATAAG AG AG ACT T AC CCTATCATGT TTTTGGTTTC GAGGACTATA 1500 

TGCCTAAGCT AAAT AC AG C C AATCCAGAGG TCAAGAATTA TCTTTTAAAG GTTGCGACTT 1560 

ATTGGATTGA AGAGTTTAAT ATCGATGCTT GGCGTTTGGA TGTGGCTAAT GAGATTGACC 162 0 

ATCAGTTCTG GAAGGATTTT CGTAAGGCAG TTTTAGCTAA AAATCCTGAT CTTTATATCC 1680 

TAGGAGAAGT CTGGCATACA TCTCAGCCTT GGCTAAATGG AGATGAGTTC CATGCCGTCA 1740 

TGAATTATCC TTTATCTGAT AGTATCAAGG ACTATTTCTT ACGAGGAATT AAGAAGACAG 1800 

ACCAGTTCAT CGATGAAATC AATGGAGAGT CTATGTATTA CAAGCAGCAG ATTTCAGAGG 1860 

TCATGTTTAA TCTCTTGGAT T C ACATG AT A CAGAGCGAAT CCTGTGGACG GCCAATGAAG 192 0 

ATGTTCAACT GGTTAAATCA GCCTTAGCCT TTCTCTTTTT ACAAAAAGGA ACACCGTGCA 1980 

TTTATTACGG AACCGAGCTA GCCTTGACTG GAGGACCAGA TCCAGATTGT CGTCGTTGTA 2 040 

TGCCTTGGGA ACGTGTATCA AGTGACAATG ATATGCTGAA CTTTATGAAG AGGCTGATTA 2100 
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AAATTCGGAA ATACGCGTCA GTAATCATTT CGCATGGCAA GTATAGCCTT CAAGAAATCA 2160 

ACTCTGATCT AGTAGCTCTG GAATGGAAAT ACGAAGGACG GATCCTCAAA GCAATATTCA 222 0 

ACCAATCAAC AGAAGATTAT CTTTTAGAGA AAGAAGCAGT AG C AC TAG C A AGCAATTGCC 2280 

AAGAATTGGA TAATCAGCTT GTCATCTCTC C AG AT GG AT T TATGATTTTC TAAAAACTAG 2340 

TTGATGAAGA TTATGGTACA TTTCATACCT TATATAGTAT AATAAGGCTA GTTACTAAAC 2400 

TTGTAAAGGA GAACTTAAAT GAATTGTAGA GGACATGAAA CAAGACAAAG AATTGTTAGA 2 4 60 

GATTTTGAAG TTCAGCCTAA AG C AC AT AT T AAGCTGTTAG CAAATCAACA AAAACATAGT 2 520 

GATGCAGGAG CAACTATTGA AGATGAATAT TATGTATTTA TCGCTGAGAG TAAAATTGAT 2580 

GGCAAGAAGG AAGTTATTCA GTGTTGCATG GGTGCGGCAA GGGATTTTTT AGAACTAATT 2 640 

AATCACAAAG GGCTACCTCT TTTTAATCCG CTTGTAGGTG ATTCTCATGT AAATAATAGA 2700 

CAAGAATATG ACAATACAGG GAGTGGAAAT TTATAACCTG AAAAGTGGAA TGAAACTGCA 2 7 60 

AAGCAGCTTT ATAATGCTAT AATGTGGTTG ATTATTTTAT GGAATGCTAA GCCGGATACA 2 82 0 

CCTTTATTTA ATTTTAAAGA CGAAGTAATT AAGTATAAAA CATATGAGCC TTTTGAAAGC 2880 

AGTATAAAAA GAGTAAATAC TACTATAAAG AATGGTAGTA AAGGGAAAAC TCTGACTGAG 294 0 

AT GAT T AATG GCTACAGAGC GGATAACGAT ATT AG AG AT G AAATTTGTAA CTTTAATATT 3 000 

CTGAAAAATA AAATTCGTGA TATGAAAAAC CAACAAGGAA ATACAATGGA ATCTTACTTT 3 060 

TAGTTATTGT TGAATTTTGG GT AT T C TATA AAATATCCTA ATTGAGATTT AAATAGTAGA 312 0 

CTATACAATA TAGTTAAAAT ATCAGTAAAA ACAACACTTT ATTGAGGTAT TGGATACGCT 3180 

TTGCTAATAG CCTAATAATC ACATGTGGAG TGTTGCTACA ACGAAAAAGG TGATAATCCT 3240 

TGATTTCAAG CTATTTTATA AGCATTTTGT CTTTGTAGAT AAAGGCAATT TTGACAATAA 3 3 00 

AAATCCTAAA AGGTGAATCG T T AT AG ATGT ATTTGTAGAT ATCGTTTGCG CATCGAAAAA 3 3 60 

ATTAATACAA GAATAAATAT TTATAGCTCT TTAGGTGACT TTTATAGAAG T AAAG T T TAG 342 0 

GATAGAAAAA CAAGAAATAA CGCACCATTT TTGGTGCGTT ATGCTTTTTT ATGCTATAAT 348 0 

GGATTTATAA AAATAAAGGA GTTTGCTATG ATTGGAAAGA ACATAAAATC CTTGCGTAAA 3 54 0 

ACACATGACT TAACACAACT CGAATTTGCA CGG AT TGTAG GTATTTCACG AAATAGTCTG 3 600 

AGTCGTTATG AAAATGGAAC GAGTTCAGTC TCTACCGAAT TAATAGACAT CATTTGTCAG 3 6 60 

AAGTTTAATG TAT CTT ATGT CGATATTGTA GGAGAAGATA AAATGCTCAA TCCTGTTGAA 37 2 0 

GATTATGAAT TGACTTTAAA AATTGAAATT GTGAAAGAAA GAGGTGCTAA TCTATTATCT 37 80 

CGACTCTATC GTTATCAAGA TAGTCAGGGA ATTAGCATTG ATG AT GAG T C TAATCCTTGG 3 840 

ATTTTAATGA GTGATGATCT ATCTGATTTG ATTCATACGA AT AT C T AT CT AGTAGAAACT 3 900 
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T TTG AT G AAA TAGAGAGATA TAGTGGCTAT TTGGATGGAA TTGAACGTAT GT TAG AG AT A 3 9 60 

TCTGAAAAAC GGATGGTGGC CTAATGGAAA TCCAAGATTA TACTGATAGT GAATTCAAAC 402 0 

ATGCTTTAGC AAGGAATCTT CGTTCACTGA CAAGAGGAAA AAAGTCCAGT AAGCAACCTA 4 080 

TAGCGATTTT GCTTGGAGGG CAAAGTGGTG CCGGTAAGAC TACAATTCAT CGTATTAAAC 4140 

AGAAAGAATT TCAAGGAAAT ATTGTTATCA TAGATGGTGA TAGTTTTCGT TCTCAGCATC 42 00 

C AC AC T ATT T AGAACTGCAG CAAGAATATG GCAAAGACAG TGTAGAATAT ACCAAAGATT 42 60 

TTGCAGGAAA AATGGTAGAG TCTTTAGTAA CAAAATTGAG TAGTTTGAGA TACAATCTTT 43 2 0 

TGATAGAGGG AACTTTACGA ACAGTTGATG TTCCAAAGAA AACAGCACAA CTCTTGAAAA 43 8 0 

ATAAGGGATA TGAAGTACAA TTGGCCTTAA TTGCGACAAA GCCTGAATTG TCGTATCTAA 4440 

GTACTCTTAT CCGTTATGAA GAACTGTACA TTATCAATCC AAATCAAGCA CGCGCAACTC 4 500 

CAAAAGAACA TCATGATTTC ATTGTAAATC ATCTAGTTGA TAACACACGA AAATTGGAAG 4 5 60 

AACTAGCTAT CTTTGAAAGA ATTCAAATTT ACCAACGAGA TAGAAGTTGT GTATATGATT 4 62 0 

CAAAAGAAAA TACAACTTCA GCAGCAGATG TTCTTCAAGA GTTACTCTTT GGGGAGTGGA 4 6 80 

GTCAGGTAGA GAAGGAGATG TTGCAGGTGG GGGAAAAGAG ACTTAATGAA TT AC T T G AAA 4 740 

AATAAACAAT TGATATTTTT AGGAGAATAG AAATGAGAGG GTTTAATAAC AAGATAAAGT 4 800 

CTGTTTATCA AGAACTAACA AATTCCAAAG AGAAATTCGG TAGCTTTCAC AAGACTTTAA 4 860 

TTCATTTGCA TACACCTGTT TCTTATGATT ACAAGCTATT TTCTAATTGG ACTGCAACGA 4 92 0 

AATATAGAAA AATTACTGAA GATGAACTAT ATGATATATT TTTTGAAAAT AAGAAAATAA 4980 

AAGTTGATAA GACAATTTTT TTTAGTAATT TTGATAAGGT TGTTTTTTCT AG TT C AAAAG 5 04 0 

AATATATTAG TTTTCTTATG TTAGCAGAGG C AAT C AT AAA AAATGGAATA GAAATAGTTG 510 0 

TAGTAACTGA TCATAATACT ACCAAAGGTA TTAAAAAGTT ACAAATGGCA GTCTCAATCA 5160 

TAATGAAAAA TTATCCGATT TATGATATAC ATCCTCATAT TTTACATGGA GTAGAAATTA 52 2 0 

GTGCAGCAGA TAAATTGCAT ATTGTATGTA TATATGATTA TGAACAAGAA TCATGGGTTA 52 80 

ATCAATGGTT AAGTGAAAAT AT T AT AAGTG AGAAAGATGG AAGTTATCAA CATTCACTGA 5340 

C TAT AAT G AA GGATTTCAAT AATCAAAAAA TAGTTAACTA TATTGCTCAT TTCAATAGTT 54 00 

AT G AC AT T T T GAAAAAAGGT TCTCACTTAT CAGGTGCATA TAAACGAAAA ATTTTTTCTA 54 60 

AAGAAAATAC ACGATTTTGG AGTTTAATAT TAACTCGAAA GAATCTTCGC AACAACTTGA 5520 

TATTCTCTAT AAAGAAGTTG GTGTATTAAG TTTGGGACAA AAAGTTGTAG CCATGCTTGA 5580 

TTTTTTATTA GCATATAGTG ATTATTCTAA AGACTTCAGA CCATTGATTA TTGATCAGCC 5 64 0 
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TGAAGACAAT CTAGACAATC GTTATATTTA CAGGCATTTA GTTCAGCAGT TTAGAGATGT 57 00 

GAAAGCTCAA CGTCAAATTA TTTTAGCAAC ACATAATGCT ACAATTGTAA CAAATTCTAT 57 6 0 

GACAGATCAA GTTGTTATTA TGGAGTCAGA TGGAGTTAAC GGATGGATTG AATCACAGGG 582 0 

ATATGTTAGT GAAAAATATA TAAAAAATCA TAT CAT C AAT CAATTAGAGG GAGGAAAAGA 5880 

TTCCTTCAAG CATAAAATGT C TAT AT AT G A GACGGCTTTA T C AG AGT AG A GTCAGAAAAA 5 94 0 

GTAGGTTAGA AATTTAGCCT ACTTTTTTCT TTGTCCGACA GGCATAGTGT ACATCTGAGG 6000 

TCCAAGTCCT CTGTGGATAT TTGCTGCAGA TGAAACCAAT AGCGACTCCT AAGCCTGAAT 60 60 

ATCGTGAGGT AGGGGGGATA GGAAGGAATT AGCGAAATCA AGGTTCTACA AACAGAATCG 612 0 

TGACTTGAAG C CAT AT AT AG CGGATGAGGA ACTCTAAAAT CCAAATAGGT GTCGTAACCT 6180 

ATATACGTAA ATTACGAGAG TAAACTAGGA AAGATGTACG GCTTATTCCG TGAGCGTTTA 624 0 

GGACGTAGTA CAACGAATCA TGGGAGTCAG CTGAACACAT AGTATTGAAG AAATTTCTGT 63 00 

AATGGAAATG GAGCGAAGAA GTGAACAATT AAATGAATAC CTCTCTAATT AAATTTGTCA 63 60 

ATT C T AAT T C CTGGTATGAA AAGACAGTGA CCTGAAAATG TAAACGATGG GAGCTGATCA 642 0 

TAAATATAGG ACGGTACATG C AGT GGTGTT AGAGATTAGT CCTTACTTGA TTTGTGATAA 64 80 

CTTCCCCAAA TTTCTTCTGC TATACTTTTC TCAACTTTTA AAAATCCAAC TAAGAATTTT 6540 

ACCTGGGGGT TTGGGGGCGG AGCACTAAGT TATCTTATCG TTAGCTGTCA AAACTGGTAG 6600 

GT T T TG AT AG GCTGGCGATA TGATTTTTGG GATATTGTGG ACACAATATC TGAGCTCGCA 66 60 

AAGCCTTACA AGAATGAAAA TCAGTTGTTG GAAAAGTGTA CTGACATTGT ATGGTAGCTC 672 0 

ACATTGTCAG TACAAGTATT TTGGAAAGGA AGT AG C AGT A TGAAACGAGA TGTGCGTGAT 6780 

ATTCGGAAAC AATTTCGTTT AACAGAAGCA GAAGAAAAGC AAATT C TAG C TTTGATGAGA 684 0 

GAGCGGGGAG AGACTAATTT CTCTGATTTT CTTCGTAAAA GTTTACTTTC CTCTGATTTA 6900 

CAAAAACAGA TGGAGACATG GTTTGCCCTC TGGCAATCCC AAAAACTAGA ACAAATCAGT 6960 

CGTGACGTTC ATGAAGTTTT AATCTTGGCA CAGTCAGAAC GTCAAGTCAC CCAAGAGCAT 702 0 

GTATCTATTC TCTTAACGTG CGTGCAGGAA TTGATTCAAG AGGTTGCAAA CACCATACCC 7 080 

CTCAGTAAAG AATTTCGTGA GAAGTACATG AGGTAAGCAC ATGGAACATC GTTACCGAAC 7140 

CAATCTCAAG AAAGTGTTTT TGTCTGATAG TGAGTTGAAC CAACTAAATA TAAATATCGA 72 00 

TCAAAGTGGT TGTAAATCCT TTTCTGAATA TGCGAGACGA ACTCTACTCG ATCCTGGTAT 72 6 0 

GAATTTTATC ACG AT TG AC A CAAACGGTTA CCAAGATTTA GTGTTTGAGT TAAAGAGGAT 7320 

TGGCAATAAT AT C AAC C AG A TTGCTCGAAG TGTTAATCAA TCTCAGTTAA TTTCTGGTGA 7 380 

AGAATTGCAG GAGTTGAAAA AAGGAATTGG TGAATTGATA AAAGAAGTTG ATAAGGAATT 7440 
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TAATCTGCAA GCGCAGAAGC TAAAGGAGTT C C ATGGT C AT CACTAAACAC TTTGCCATTC 7 500 

ACGGAAAGAG TTACCGCAGA AAGCTTATCA AGTACATTCT CAATCCTGAG AAAACCAATA 7 5 60 

ATCTTGCCTT GGTGTCGGAC TATGGCATGA AGAATTTTCT GGACTTTCCT AGCTATGAGG 7 62 0 

AAATGGTGCA GATGTATCAT GAAAATTTCA TCAGCAACGA TACGCTTTAC GATTTTCGCC 7 680 

ACGACAGGAT GGAAGAAAAT CAACGAAAAA T AC AC G CT C A CCACATCATT CAGTCTTTCT 774 0 

CGCCAGAGGA TCATATCACT CCTGAACAAA TCAATCGGAT AGGTTATGAG ACTGTGAAGG 7 8 00 

AATTAACTGG TGGCAAATTT CGTTTTATCG TTGCGACCCA TGTTGATAAA GACCACCTGC 7 8 60 

ACAATCACAT CATTATCAAT TCAGTAGATA GCAATTCTGA CAAAAAGCTC AAGTGGGACT 7 92 0 

ACAAGGTGGA GCGAAATCTT CGCATGATTT CTGACCGTTT TTCTAAAATC GCAGGTGCTA 7980 

AAATCATTGA GAACCGCTAT TCTCACCAGC GGTATGAAGT CTATCGTAAG ACTAATCACA 804 0 

AGTATGAACT CAAGCAGCGA CTCTATTTTT TGATGGAACA TTCTAGGGAC TTTGAGGATT 8100 

TCAAAAAGAA TGCTCCGCTA CTACATGTGG AGATGGATTT CCGTCACAAG CATGCCACCT 816 0 

TTTTTATTAC GGACTCAACT ATGAAACAGG TGGTGCGTGG CAAGCAACTC AATCGCAAGC 822 0 

AGCCTTACAC AGAAGAATTT TTTAAGAACT ACTTTGCCAA AAGAGAAATA GAAAGTCTCA 8280 

TGGAATTTTT ATTGCTGAAA GTTGAGAATA TGGATGATTT ACTTCAGAAA GCAAAACTTT 8340 

TTGGACTAAC TATCAATCCT AAACAAAAGC ATGTTTCTTT TCAATTTGCA GGAGTGGAGG 84 00 

TAAAGGAGAC AGAGCTAGAC CAGAAAAATC TTTATGATGT AGAGTTTTTC CAAGATTATT 84 6 0 

TTAAAAATAG AAAAGATTGG CAAGCTCCAG AAACTGAGGA TTTCGTTCAA CTTTATCAAG 8 520 

AAGAAAAGTT ATCCAAAGAA AAAGAACTTC CAAGCGATGA GAAGTTCTGG GAGTCCTATC 8 580 

AAGAGTTCAA GAGTAACAGA GATGCCGTTC ATGAATTTGA GGTGGAGTTG TCACTCAATC 8 640 

AAATTGAAAA AGTAGTGGAT GATGGAATTT ACGTCAAGGT CAAGTTTGGT ATTCGTCAGG 87 00 

AGGGACTTAT CTTTGTGCCG AACATGCAGC TTGATATGGA AGAGGATAAG GTGAAGGTTT 87 60 

T C ATCAGGG A AACCAGCTCC TACTATGTCT ACCACAAAGA CGCTGCCGAG AAAAATTGTT 8820 

ATATGAAAGG TCGAACCTTA ATTAGACAGT TCAGCTATGA AAATCAAACC AT T C CAT T AC 88 8 0 

GCAGAAAAGC GACAGTCGAT ATGATTAAAG AG AAGATTG C GGAAGTGGAT GCTTTGATTG 894 0 

AACTGGAAGT AGAAAATCAA TCTTATGTCA CGATTAAAGA TGAGTTAGTG CATGAACTAG 9000 

CAGCGTCTGA ATTGAGAATC AATGAGTTGC AAGAACGAAT GTCAACCTTG AATCAAGTAG 9060 

CAGAATATCT ACTGGCTTCA GTTGAAAGTA AGCAAGAAAT GAAATTAAAT CTTTCAAAAC 9120 

TGAATATAAC TGAGAATATC AGTGCTAATA TTGTTGAGAA AAAATTGAAG AGCCTGGGGA 9180 
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ATCAACTGGA ATTGGAAAGG GGCAGGTATG AAAAGATGGT AGT 92 2 3 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

TCTGCTGGCT ACCATCATCT GACTTGGGCA AGACCAAAGT CTTAGTTACA ACTGTATTCT 60 

TCTCAGCATT TTCAATAACT GGCAATGCCG ACTGAAGCGT ATCTTTTTCT GTTTTTGTAG 12 0 

CTGGTCCAGT TTCTTTTTTC TGTCCGCAAC CAACCAGGAC AAAAAGGAAA G CT AG ACT AA 180 

CAAGAACTAT TTTTTTCATT TCTTTCTTCT TTCTTTTTGA AATTAAAATA GAATAAGACT 240 

GGGAAGTGCT CCCAGCCTTG ATGTTTATAG AGCTGCACGC AAACGTGCTT CTGCATTTTC 3 00 

TACATTACGG ACAGAGCGTG GTAGGAAGGC ACGAATATCG TCTTCCTTGT AGCCAACTTG 3 60 

CAGGCGTTTT TCATCTACAA GGATTGGGCT CTTTAAAATT CTCGGTGTTT CCATAATCAG 420 

ATTGAGAACT TCATTGACAC TCAAATCTTC AATATCCACT CCAAGGGCTT TGGCATAGCG 4 80 

ATTTTTAGAC GAAACGATGC TGGCTATTCC GTTATCTGTT TTGGTTAGAA TATCCAGTAA 540 

TTCTTCTCTC GTAATTCCTT CTTTACCAAG GTTTTGTTCT TTATAACTTA ACTGGTGGGC 600 

ATTGAGCCAG GTTTTTGCTT TTTTACAGCT AGT AC AAC TT GAGACTGTAT AAATTTTAAT 660 

CATGTACCTA CCCCTTTCGC TACATGTTAC TATCAGTTTA GTCTATTATA CCATAAAAAA 72 0 

CATCCGACTT GCGACCTATT TTTAATTTTT TTTGACTTTT TTCGTCATTT TCGTACTTTT 780 

TTCTTGACAA ACAACTAAAT GACTATCAAC TCTTTTGGAG CTAGGGTCAA TAATTCACAA 840 

CCTGTCTCTG TAATCAGGAT ATCATCCTCG ATACGAACGC CATATTTGCC TTCGATATAG 900 

ATACCTGGTT CAT CGGT C AA GGCCATACCT GTCTTAATAG TTTCTGTAGA AGT CTG ACT A 960 

AAGTAGGGTT CCTCATGGAT ATCCAGACCA ATACCGTGGC CAATGCCGTG AGTAAAGTAG 102 0 

TCACCATAAC CTGCCTCAAT GATAATATCA CGAGGGATTT TGTCAAAGTC ACGGAAACCT 1080 

AAGCCTGCCT TAGCTTGGTC AATCAAGGCT TGGTTAGCTT TTAGAACCGT ATTGTAAATC 1140 

TCTGCCTGCT CATCGCTAAC ATGCCCTAGA TAGATAGTCC GGGTCATATC ACTGACATAG 1200 

TGGTCATAGA GACAGCCGAA GTCCATGGTG ATGGCTTCTC CCAACTCCAC TGGTTTGTGC 12 60 

ATTGGATGGG CATGGGGTTT AGAAGAATTG ATACCGCTAG CTAGGATCGT AT C AAAAG AT 1320 

AAGCCAGATG CTCCCAACTC ACGCATGCGG AAATCAAGGA AGTTGGCAAT CTCAATTTCA 13 80 
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GTTTTTCCTG GTTTGATAAA GTCAAGCGCA TCGCGGAAAG CTTGGTCTGA GATAGAACAA 1440 

GCCTTGCGAA TCGCTGCAAT CTCTGCCTCA TCCTTAATCA TACGAAGACC TTCCACAAAC 1500 

TGAGTTTGTG GAAGCAAGTT CAAACCTGCA AAAGCTGCCT GCATACGGTG GTAATAAGAC 15 60 

ACTGAAATCT CATCTTCAAA ACCGATACGA GTCAAGCCCA TGTCCTTAAC AATTCCTGCA 162 0 

ATGACAGCCA ATT CATC ACG ATCAGCCACA ATCTCAAAAC CACTGGTTTC TTGCTTAGCT 1680 

GCG AT GAT AT AGCGAGAGTC TGT C ACT AAG ACCTGACGGT CACGACTGAT AAAGACTGTT 174 0 

CCGTTTGAGC CCCAAAAACC AGTCAAATAA TAGACGTTTT TAAGATTGTT GATGATGATA 1800 

CCATCTAGTT CTTTTTCTTG CATTTTAGCT AGAAATGCTT GTACGCGTTT ATTCATGATG 1860 

TAACTTTCCT TTCAAATAGT GTCCTGTATA GCTGGCTTCG TTGGCAGCTA CTTCTTCTGG 1920 

AGTTCCTGTT ACGATGATGG TTCCACCACC GACACCGCCC TCAGGTCCCA AGTCAATGAT 1980 

ATGGTCTGCC GTCTTGATAA CATCCAGATT GTGCTCGATG ACGAGGACTG TATTGCCATC 2040 

GTCTACAAAG CGAGCTAAAA CCTTGAGCAG GCGAGCAATG TCCTCTGTAT GAAGCCCTGT 2100 

CGTCGGCTCA TCCAGAATGT AGAAAG AT T T TCCTGTCGAT CGTTTGTGGA GTTCGCTAGC 2160 

TAACTTCATA CGTTGGGCTT CTCCCCCAGA AAGGGTGGTA GCTGGCTGTC CCAAGGTCAC 2220 

ATAGCCTAGC CCTACATCCT TGATGGTCTG GAGTTTGCGT TGAATTTTCG GAATGTGTTG 2280 

GAAAAATTCT ACCGCATCGT TGACCGTCAT ATCCAAGACC TGCGAAATAT TCTTTTCCTT 2340 

GTAGTGAACT TCTAGGGTTT C ACTGT T AT A GCGGGTTCCG TGGCAAACTT CACAAGCCAC 2400 

AT AAAC AT CT GGCAAGAAGT GCATCTCAAT CTTGATAATC CCGTCACCTG AGCAAGCTTC 2460 

ACAGCGACCT CCCTTGACGT TGAAACTGAA GCGCCCCTTC TTGTAGCCTC GAATCTTGGC 2520 

TTCATTTGTC TGAGCAAAAA GGTCACGTAT ATCGTCAAAA ACTCCTGTAT AGGTAGCTGG 2580 

GTTAGACCTC GGCGTCCGTC CGATAGGGCT CTGGTCAATA TCAATCAAAC GGTCGACATG 2640 

CTCAATCCCT GTAATAGTCT TAAACTTACC AGGTTTGTCT GAATTACGGT TGAGCTTCTG 2 7 00 

GGCAATGGCT TTTTTGAGAA TGCTGTTGAT TAGAGTCGAT TTCCCTGAAC CCGACACACC 2 7 60 

TGTCACTGCG ATAAATTTTC CTAGTGGAAA GCGAGCCGTG ACATTTTGCA AGTTGTTCTC 2 820 

ACGCGCTCCT ATCACTTCAA TAAAACGACC ATTTCCGACA CGGCGCTCTT CTGGTACTGG 2 880 

GATGACACGT TTGCCTGACA AGTACTGACC TGTGATAGAC TTGCTGTTGC GAGCCACTTG 2940 

CTTAGGTGTA CCTGCTGCAA CAATCTCACC ACCAAAAACA CCGGCACCAG GACCAACGTC 3000 

AATCAGATAA TCAGCCTCAC GCATGGTATC TTCGTCGTGT TCCACCACGA TAAGAGTATT 3060 

GCCCAAGTCA CGCATCTTTT TCAGACTGGC AATCAGGCGA TCATTGTCCC TCTGGTGAAG 312 0 
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AC CG AT TG AC GGCTCGTCTA GGATATAGAG GACACCTGAT AGGTTGGAAC CAATCTGGGT 318 0 

TGCCAAACGA ATGCGCTGAC TTTCCCCACC TGAAAGGGTT CCTGCTGAAC GTGACAGGGT 3240 

TAGATAGTTA AGACCCACAT TATTAAGGAA GGTCAAACGA TCCTTGATTT CCTTGAGAAT 3 3 00 

GGGACGAGCA ATGATGGCTT CATTTTCAGA CAAAGTTAAC TGGCTCACCA AGTCCAAGTG 33 60 

GTC AG C GAT A GACAGGTCTG AGATTTCTCC AATATGTGGC CCTTGCTGGC CGCCCACACG 342 0 

GACAGACAAG GCCTGGTCAT TGAGACGATA G C CTTG AC AG GTTCCGCAGG TCAGCTCATT 34 8 0 

CATGTAGAGA CGCATCTGAG TGCGAGTGTA ATCGCTATTG GTTTCATGGT AACGACGTTT 354 0 

GATATTATTG ATAACTCCCT CAAACGGAAT GTCGATATCG CGCACGCCAC CAAATTCATT 3 600 

CTCATAGTGG AAATGGAATT CCTTACCATC TGACCCATAG AGAATCAAGT TCTTATCTTC 3 660 

TTCTGACAGG TCCTCAAAAG GCTTATCCAT AGCCACTCCA AAGACTTTCA TGGCCTGCTC 372 0 

TAACATGTTT GGATAGTAGT TGGATGAGAT AGGATTCCAA GGTGCTAGCG CTCCCTCACG 3780 

TAAGGTTTTG CTAGCATCTG GCACTACCAA ATCAGTATCC ACCTCCAGCT TGATGCCCAA 3 840 

GCCGTCACAC T C ACT AC AAG AGCCAAAAGG AGCATTGAAA GAAAAGAGAC GAGGCTCTAA 3 900 

CTCTGGGACA GTAAAACCAC AAACTGGACA GGCATAATGC TCAGAGAACA ACAACTCCGA 3960 

GTCGTCCATG GTGTCGATAA TGACATAACC TTCTGCAATA CGAAGGGCAG CCTCAATGGA 402 0 

AT C AAAGAG A CGACTACGAA TGCCCTCCTT GATAACAATA CGGTCAACCA CGACATCGAT 4 08 0 

ATTGTGTTGC TTGCTCTTAG ACAACTCTGG CACTTCGGTC ACATCATAGA CTTCCCCATC 414 0 

CACACGGACA CGAACATACC CGTCTTTCTG AACCTTCTCG AT AAC AC T CT TATGTTGGCC 42 00 

TTTTTTCTTG CGGATGACAG GAGCCAAGAT CTGCAAGCGC TGGCGTTCAG GTAACTCCAA 4260 

AACCTTATCA ACGATTTGCT C C AC AGAAG A AGCATTGATA GCTCCATGTC C GTTG AT AC A 432 0 

GTAAGGCGTC CCCACACGTG CGTAGAGGAG ACGCAGATAG TC AT TG AT TT CAGTCGTCGT 4 380 

TCCCACCGTC GAGCGAGGAT TTTTACTAGT CGTTTTCTGG TCGATGGAAA TAGCTGGGCT 4440 

GAG AC CAT C A ATGGCATCTA CATCTGGTTT TTCCATATTT CCCAAGAACT GACGAGCGTA 4500 

GGCGGACAAA CTCTCTACAT AGCGACGTTG TCCCTCCGCA TAGAGAGTAT CAAAAGCCAG 4560 

ACTGGACTTC CCTGAACCTG ACAAGCCAGT CACGACAACC AACTTGTCTC GCGGAATCTC 4620 

CACATCAATA TTTTTTAAAT TATGGGCACG CGCCCCATGA ATGACAATTT TATCTTGCAT 4 68 0 

CTTTGTTCTT TCTAGTCCAT TATTGCTTAC CATTATACCA AAAAAAGTGA GATTCTATTA 474 0 

CCCAAAAGGC CGATTTTGTA GTATAATAGT ACAGTGTGAA AAAATCTGAA AAAT GAG AAA 4800 

GGATAAGGGA TATGAAACAA GTTTTTCTCT CTACAACAAC TGAATTTAAA GAGATCGATA 4860 

CGCTTGAACC GGGTACTTGG ATCAATCTCG TCAATCCGAC TCAAAATGAA T C AC TC G AAA 4920 
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TCGCCAACAC CTTCGATATT GATATTGCTG ACCTTCGAGC ACCGCTCGAT GCGGAAGAAA 4980 

TGTCTCGTAT TACCATTGAA GACGAGTATA CCCTGATTAT CGTAGACGTG CCGGTCACGG 5040 

AGGAAAGAAA TAACCGCACC TACTACGTAA CCATCCCGCT TGGTATTATC ATCACTGAGG 5100 

AAACCATTAT CACTACGTGT TTGGAACCAC TACCTGTCCT TGATGTCTTT ATCAACCGTC 5160 

GATTGCGTAA TTTCTATACC TTCATGCGTT CACGTTTTAT CTTTCAAATT CTTTATCGCA 5220 

ATGCAGAGCT TTACCTAACA GCCCTTCGTT CAATCGACCG CAAGAGTGAA CAAATCGAAA 52 80 

GTCAACTGCA TCAATCAACT CGTAATGAAG AATTGATTGA GCTCATGGAA TTGGAAAAAA 534 0 

CTATCGTCTA TTTCAAGGCC TCCCTCAAAA CAAATGAGCG CGTGATTAAG AAATTGACCA 54 00 

GTTCAACCAG CAATATCAAG AAATACCTTG AGGACGAAGA CCTGCTTGAA GACACCCTGA 5460 

TTGAAACCCA ACAGGCCATC GAGATGGCAG ATATTTATGG AAACGTCTTG CATTCTATGA 5520 

CAGAGACCTT TGCCTCTATC ATTTCTAACA ACCAGAACAA C AT C ATG AAA ACCTTGGCCC 5580 

TTGTGACCAT CGTCATGTCC ATCCCAACCA TGGTCTTTTC TGCCTACGGG ATGAACTTTA 5 640 

AGGATAATGA AATCCCCCTA AACGGAGAGC CAAATGCCTT CTGGTTAATC GTCTTTATCG 57 00 

CCTTTGCTAT GAGTGTCTCG CTCACTCTCT ATCTCATCCA TAAAAAATGG TTCTAAGAGG 57 60 

AGTTCCTATG TCTCAAATTG AT CT ACAAAA AT t AACTAAG AAAAACCAAG AGTTTGTCCA 5 820 

CATTGCTACC CAACAATTCA TCAAAGATGG GAAAACAGAC GCTGAAATCC AGACTATTTT 5 8 80 

TGAGGAAGTC ATTCCCCAAA TCCTTGAGGA GCAATCTAAA GGTACAACTG CCCGTTCCCT 5940 

ATACGGCGCA CCAACTCATT GGGCTCATAG CTTCACTGTC AAAGAGCAGT ACGAAAAAGA 6000 

GCATCCAAAA GAAAATGAT G ACCCAAAACT GATG AT TAT G G ACT C AG CT C TTTTCATCAC 6060 

TAGCCTCTTT GCCCTTGTCA GCGCCCTCAC AACCTTCTTT GCGGCAGACC AAGCTTTCGG 612 0 

CTATGGATTG ATTACTCTTC TATTAGTTGG ACTGGTTGGT GGATTTGCCT TCTACTTGAT 6180 

GTACTACTTT GTTTACCAAT ACTATGGACC AGATATGGAT CGCAGTCAAC GTCCACCTTT 624 0 

CTGGAAATCT GTACTAGTTA TCCTAGCTTC TATGTTCCTT TGGTTGCTTG TCTTCTTTGC 6300 

AACAAGCTTC CTACCAGCTA GCCTTAACCC AGTACTGGAT CCATTGCCAC TAGCTATTAT 6360 

TGGAGCAGCC CTCCTAGCCC TTCGCTTCTA TCTCAAGAAA CGCTTGAATA TCCGTAGTGC 6420 

AAGTGCAGGA CCAACACGCT ATCAAGAATA AGAAAACGAT AAAAGCAACT GCAGGTGCGG 648 0 

TTGCTTTTTC ACTTACTTTT TTGAGTTATA T TC AATG AAA ATCAAAGAGC AAACTAGGAA 6540 

GCTAGCTGCA GGTTGCTCAA AGCACAGCTT TGAGGTTGCA GATAAAACTG ACGTGGTTTG 6600 

AAGAGATTTT CGAAGAGTAT TAAAAGT AT T CTTCTGAAAT CCCACATAGC TTTCTCTTAT 66 60 
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ATTTTGTGAT AAAATAGGCT CAATCTATTT CTAGGAGGAT GAGATATGGT TTCTACTATT 6720 

GGTATTGTTA GTTTATCTAG TGGCATTATC GGAGAGGATT TTGTCAAACA CGAAGTGGAC 67 80 

TTGGGTATCC AACGTCTCAA GGATCTGGGA CTCAATCCCA TCTTTTT 6827 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

CTGGCTAGTT GCATAGAGCA AAGTTGCTTC TTCATCAACA AAACCGTTCA TTTCAAAATA 60 

GGAAAGCAGC TCATCAGGAC TCTCCAAACG AATCCCTTTG TAATCCAGCT CAACTGCCAC 12 0 

CTCTTTCAAG GCTGCAAGAA GAAGTGTTCC CAGGCCCTGT CTCTGATGGT CAAACTCGAT 180 

GACTAAAGAA TGTACTTTTA GACATTGCGG ATTGTCTGAC TGGGGACTTG ATAAAATATA 24 0 

GCCTAAAAGT TGATTTTCAT CCCTAGCTAG AAGAAAGGTA TCCGCACACT TACGGATACT 3 00 

TTCTTCTAAA ATATGGGAAA GTTGCTGCTT TTCAGCTGGA AAAGACGAGG TCTGAAGTGC 3 60 

CCCTATCTCA GGCAAATCAG ACTTGCTTGC CTGAATGATC TTAATTGGAA TTTCCATGGG 42 0 

AACATCCTAT TGAACATTGC TTGTCAAGTT AGACAAGAGA CGCTCAAATG AGTATTCATA 4 80 

GGTTTGGATG TCTCCTGCTC CCATAAAGAC GTAAACAGCA TTGTCATGGT CTAGGAGTGG 540 

AGAAACATTT TCAACAGTAA TCACTTGGTG TTTTTTGTTG ATTTTGTTGG CTAGGTCTTC 600 

TACCTTAACG TCACCATGAT CTACTTCACG AGCCGAGCCA TAAATTTGCG CTAGATAAAC 660 

AGCATCTGCT TGGTTTAAAG CATGGGCAAA GTCGTCCAAC AAGGCAATGG TTCTTGTAAA 72 0 

GGTATGCGGT TGAAAGACTG CTACAATTTC CTTGCTTGGG TATTTCTGAC GAGCCGCATC 7 80 

CAAGGTCGCA ATAATTTCTG TTGGATGGTG GGCAAAGTCA TCGATAATCA CTGTATCATT 84 0 

GACAATTTTC TCAGTGAAAC GACGTTTAAC ACCGGCAAAT GTTTTCAAGT GCTCACGCAC 900 

C AAGTT C AAA TCAAATCCTG CTGTGTAAAG AAGACCAATA ACGGCTGTCG CATTCATGAT 960 

ATTGTGACGA CCAAAGGTTG GAATGTGGAA TTGCCCCAAG TTTTGTCCAC GGAAATGAAC 102 0 

GGTGAAGGTT GAACCAGTTA TTGAACGAAG AAG AT C ACT A GCTACAAAGT CATTGCCTTC 1080 

AGCTTCAAAA CCATAATAAT AAATTGGTGC ATCAGACGTA ATCTTACGCA AT T CAG CATC 114 0 

TTCACCATAG ACAAAAAGAC CCTTGGTGAT TTGTTTGGCA TAGTCGTTAA AGG C ATT AAA 12 00 

AACATCCTCG AG AC T TGTG A AATAATCTGG ATGGTCAAAG TCAATGTTGG TGATAATAGA 12 6 0 
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GTATTCTGGG TGGTAAGGCA TGAAGTGACG CTCATATTCG TC AG ATT C AA AGACAAAATA 1320 

TTTGGCATTG GCCGAACCAC GACCTGTCCC ATCTCCAATC AAGAAGCTGG TATCTGTAAT 13 8 0 

GTGAGACAAG ACATGAGACA ACATACCTGT CGTTGAAGTT TTTCCATGTG CTCCTGCTAC 1440 

TCCCATGCTA ACAAAGTCAC GCATAAAGCT ACCTAGAAAC TCATGGTAAC GTTTGTAGCT 1500 

GATACCATTT TGGTCCGCAT AGGCAATTTC GACGTTGTTA TCTGGACGAA AGGCATTTCC 1560 

AGCGATAATT TCCATATCAC CGTCTAGATT TTTTTCATCA AAAGGAAGAA TGGTAATTCC 162 0 

TGCCTGCTCA AGACCGCGTT GGGTAAAGTA GTACTTTTCA AC AT C TG AT C CCTGAACCTT 1680 

GTGCCCCATC TGGTGCAACA TCAAGGCCAA GGCACTCATC CCTGATCCCT TAATTCCGAT 1740 

AAAATGATAT GTCTTTGACA TGTTTTCTCC CCTATTCTGT CATTCTGGTC AGATTCAACT 1800 

CTTGGGCAAC CCGACGTTCT TGTTCTGTTT GTTTACTTTT TTTATTGTAG ATTTGGCTCT 1860 

T CT TT AG AAA ATCATAATTG TTTTTCTTTG GAGCAGGTGC TGACACTTCT TCATTCTTGG 192 0 

TAGGGATAGA ATGAACTTCT TCCGCCAAGA TATAATGAGA CTGGGTCAAT TTTTGGCTAT 19 80 

ATTTGACAAA TTCACCAGGA TTTTCCTTTT GGAAAGGAGC TGTCGGTTGA TTGCCCTGTC 2040 

T AACTAG AC T GGGCTGAGAA TGACGTCTCG C AAGG CTGAA ATCCTGAGTT AGGTAGTTAG 2100 

CAGAGCGTTT CTTTTTCAAG TCCGCACGCG CTTCTTCACG CGCCACCTCC GCATAGCTCT 2160 

TTCCTTCTTT TTTAACCCCT AAAGGAGCCT TTTTAGGTTT TTCGACTTGC TTTTCAATCG 222 0 

GTTTTACTGG TTTTTCTTCA GCAATAGGAG CCCATTCTAA ATAATTTTTA TCTCGATACT 2280 

CACCCTTGAT ATTACTGATC AGATCAGACT CATCATAGAG ATTCATGACT GGCATTTCAG 2340 

TCAACATGAC CTCGTCATCT GACACCAATG GAAATCGTTC TTGTTTCATT TTCTATTTCC 2400 

TTTCAACACT TCATTATAGC GTATTGTCTT GATTTTTCAA GTGCTGGCTT CAGAAATTCC 24 60 

CAAAATTTCT CTAATTTCTG CTAGGGTCAG ACTACCACGT GACTCTGTGC CGTCCAATAC 2 520 

TTGTGACACC AGATGTTTCT TTTGTTCTTG GAGTTCCTGA ATTTTTTCTT CAATGGTTCC 2 580 

CTTGGTCACC AAGCGATAGA CCTCAACCGT TTCTTCCTGA CCCATCCGAT GGGCACGGCC 2640 

AATGGCTTGC GCTTCCACCG CAGGATTCCA CCAAAGGTCA ACCAAGATCA CTGTATCTGC 2700 

ACCTGTCAGG TTCAGACCGA CCCCACCAGC CTTGAGGGAA ATCAGAAAGG CATCTCTTTC 27 60 

TCCTTGGTTA AAGGCCTTGG TCATGTCTTG TCTTTCCTTG GCTGGGGTTG AACCCGTAAT 2820 

TTTAAAGGAA GTCAGGCCCA AGTCTGGCAG TTCTTGTTCA ATTTTTTCCA ACATTCCCTT 2880 

GAACTGAGAG AAAATCAAGA CACGGTGTCC GCCGTCTGCC ACCTGTACCA GTAGGTCTCG 2 940 

GAGACTATCT AGTTTGCCGC TGGCTCCCTG ATAATCTTCC ATAAACAGGG CAGGAGTGTC 3000 
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ACATATTTGA CGCAAGCGCA TCAAACCAGA TAAAATTTCC ACACGACTTC GCTGAAATTC 3 060 

CTGTTCTGAC ACTTGAGCCA GATGGTCTCG CATCTGTTGT AACTGGGCAA GGTAAATAGC 312 0 

CTTTTGCTGG TCTTCCAGTT CAT T T TT AT A AACCACCTCA ATCAAGTCTG GCAATTCAGT 3180 

CAGAACTTCT TCTTTCTTGC GTCGCATCAC GAAAGGCTTG ATAAACTGAG CCACTCGCTC 3240 

TGCTGGCAAT TT C AT AAATT CTTTCTTGCT TGGCAAAAGT CCAGGCATGA CGATTTGGAA 33 00 

AATAGACCAC AACTCACCCA GATGGTTTTC AATCGGAGTT CCTGACAAGG CAAAGACCGA 3 3 60 

CGGCACCACA AATTGTCTCA AGGTCTGGGC AATCTTGGTC TGGGCATTTT TCATGACCTG 3 42 0 

AGCCTCATCT AAGAAAAGGA AGTCAAAGGC CATCCCTTGA TAAAACTCAC TGTCCTGACG 3480 

GAAGGTGGCA TAGCTAGTCA CATAGATTTG ATGGCTCTCG GCAAGAATCT CCTCACGACT 3 54 0 

TGCTTTCAAA CCATGAACAA C AG T C AC AT C CAACTGTGGA GCAAATTTCT GAAACTCATC 3 600 

TGCCCAGTTG TAAATCAAAC CCGACGGAGC GAGAATCAAA ACCCGACTTT CTTTTGTCAC 3 660 

TTGACTAGTC AAAAAAGCAA TGGTCTGAAG GGTTTTCCCA AGT C C CAT AT CATCAGCCAA 3 720 

AATCCCACCA AAACCATAAT GATGGAGCAT CTGCAACCAG CCAATTCCCT TTTCCTGATA 3780 

ATCTCGCAAG TCAGCCTTGA CCTGAGTTGC TTGCAAAGGA AAGTCCTCTG GATGCGTCAA 3 84 0 

ATCCTGGGCC AGATTCTGGA ATTCTTGTGA AAAAGAAACA CGGTCTCGCC CTTCAAAGAG 3 9 00 

ATGAGCTAAA CTGTAGGCCA AGGATTTCCG AGCCTGCAAG GTCCCATCTT TTAATTCAAA 3 9 60 

TTGCCCCAGT T C CTGT AG AT TTTGGCGAAT TTTCTTGGTT TCTTCATCGA AAAAGTAAAC 4 02 0 

TTGATTAGAC GAATCAATAT AAAAATCCTG ATTGGCAACC AAGGCCTGCA TGGCTTGGTC 4080 

GATTTCCTCC TGGACAATAT TTTGAAAATC AAACTGGATT TCCAAGAGAC CTCCCTTGGA 4140 

GGCAATCTGC ACCTGAGGAC TCGCTAGGCT ATAAAGCTCT TCTAGTTTAT CTGATAGGTC 42 00 

AACATGCCCG AGTTTTTCAA AGACTGGAAT GAT AT C ATG A AAAAAATGAT AGACAGACTC 42 6 0 

CGCTTTTAAG GCCTGACGCC AAGATTGAAA ATCGGCCTCA AAGCCCGCAG CCAAACAGAC 432 0 

TTGGAAAATT CTTTCTTCTA AGTCTGCGTC ACTTGAAAAG GGTAATTCTT CTAGCTCTTG 43 80 

TCGGCTAGAT ACCTGTCTAT TTCCATAATC AAACTGAATT TCTAAACGAA TCCGATTATC 4440 

TTCTTCCCTG TCAAAGTAAA AAGAGGGCGC AAAAGTTTTG ATTTGTAGAC GTTCTGGAGC 4500 

TGAAACGGTG CCCATCTGGA TAAAAAGAGT CAGACAGGAG GCCAATTTGT CTCGATCACT 4560 

GCTATCAAAT TGCAGGTATT TCTTTCCTTG TTGACCCACA GGTAACGCTT TAATTTCCTT 4 62 0 

GAGAAGACGC ATCTGCTGGT CTGTTAAAAA ATAAACCTGA CCTTTATGGA AAAGTACTGC 4 680 

TCCCTGATAA AAGACATTGA CCCTAGGACT CTCACTGATT TCCATTTCAA AATAATCCGA 474 0 

GTATTCTGTT ACTGTAAAGG CAAATAGATT GGCATCAGCA TGCATATCCT GAAAAAGCAG 48 00 
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GGTTTGGTAG CTATCCACTT GATGGTCAAA TTGAAAATGG GGCAAGGCCA TCAGTAAATT 4860 

CACACCCTGC TCAAAAAAGG TCAGAGGGAA AAAGAGGTGC CGACCTTGGT TTTGGAAAAA 4 92 0 

GAGGTCTGGA ACCAGCCCTT CCTCCGTTAG TCCGTGCAAG AAAGTCAAAA GTTCTTGGCT 4980 

GGCATCATCA AAGGCTTCCC AAGAAAGAGA CTCCTCATAA ATCTTGCCAA T CAT AT AC G A 504 0 

CTTTCTCTGC TCGACAATCC TTAAAAAAAG TGGAATATCC CGAATGACAT AGTATTTTTG 5100 

GCTATTGATT TGGCCGATTC TCAGAGTCCA C AAG AT AT G A TTGGTTCCTG CTTCCACCTG 5160 

ACCCACAGCT GATAACTCAT AGGCGCATTC TGATTTTGGA G AT AAAATT C GATCCAAAAA 5220 

CTTGCCACCC AAGGTCACCT TGGTTTCAAC AGCCTCTTTT TCTTCATGAC CTTCTTCCAG 5280 

ACTCCACAAG ATTTCCTGAC CACGCTCATC ATTTTTCAGA AAATGCTCTA GCGCTGCCAA 53 4 0 

ATGCACACAG TAGCCCCTCT TTTGAAAAAA ATCACAGGCA CAAAAAACCA AATCATCCTC 54 00 

TAAACTATAG CGCAGTTCTT CTTCTGCAAC GCGAGCGTAG AGCCGATTGT TCTTTTCCTT 54 60 

G ATG AT AT C A AC C TT AC C AG TTTCATAAAG GGCAACACCT TCGATACGAA TTTTCCCCGG 552 0 

AATCAATTTA GCCATATTTT CACCTTTACC TTATCTTTTT ATTATACCAT ATTTTCGCCT 5 58 0 

ATGAAAATAG CCTTCTAGGA AGACTTTTCT CCTAGAAGGC TGGATTTTTA ACGTTTGGCA 564 0 

AAAGTAGCCA CAATCCGCTG ACAGACTTCT TGCAACAGAG ATTTGGGCAT AGCTATATTG 5700 

ATGCGGGCAT GGAGACTTCC TTCCTCTCCA AAATCCAAAC CACGGTTGAG GATAACCTTG 57 6 0 

GCTTCATTTC TCAACAACTC TTGCAATGTT TCATCAGTCA GGTCATAAGC TGAAAAGTCA 5 82 0 

AGCCAAATCA AGTAGGTACC TTGCGGTTTC ATGACCTTGA TTTTAGTCTC TTTTCCAAAT 5 880 

AG AT C CAT C A CATAATTGAT GTGGTCTTCA AAGACTTGCT TGAGTTCCTC TAGCCAATCT 5940 

TTACCGTATC GATAGGCAGC TTCTGTCGCC AAATAACCCA AGCCTGAAAT TTCATGCTGA 6000 

TTATTGGCCA ACAGGCGTTT CTGGAAAGCC AGTCTCAACT TAGGATTTTC AATGACTGCA 606 0 

TAGGAATTTT TTGTTCCAGC AATATTAAAT GTTTTAGTGG CACTGCTCAA GACGATAGCA 612 0 

AAATTTTTGA AGGCAGGATT GATGGTATTG AAAGACTGGT GTTTGTGACC AAAGAGGGTC 6180 

AAATCTTGGT GAATCTCATC CGAAACTAAC AAAACACCGT GTTTTTGGCA GAGTTGGCCA 624 0 

ATCTTCTCCA ACACTTCTTT TTCCCAAACA CGTCCACCAG GATTGTGAGG GTTGCAAAGA 6300 

ACATAGAGTT TAACCTCCTC TTCCACCAAA TCCTTTTCAA GTTGGTCAAA GTCAATCTCA 63 60 

AACAGACTAT CCTTTTCCAC TAAGGAATTA GTAATCAATC TACGATTATT CAACTTGACA 6420 

CTGCGAGCAA AGGGTGGGTA G AC AGGCGT G TTAATTAAAA CCGCCTCGCC TTCTTTTGTA 64 80 

AAGGTTTGAA TAGCTGTTGA GATGGCTGGT ACCACACCCT CGATAAAGAC AAGAGCCTCT 6540 
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TTGTCAAAGT TGTAACCGTA TTGTGTAGCT TCCCACTTTT GAACTTCCTT AATTAAGTCT 6600 

TCACTGGCAT AGGTATAACC ATAAACCAGT TGGTCTGCGT AAGTTTGCAC GGCTTGGCGG 6660 

ATTTCAGGCA AG AC C AC AAA GTCCATATCC GCTATCCAAG CTGGTAGAAC TTCACTATCC 672 0 

GTTTCTGTTT CTTTCCATTT ATAGGTATGG TGCCCTAAAC GGTTGGGCAG GCTTGTAAAA 67 8 0 

TCATATTTTC CCATCTTTGT CTTATCCTTC TATGGCTTGG CGCAAATCTG CAATCAAATC 684 0 

TCTAGCATCC TCAATCCCAA TAGACAAACG CAAGAGGTCA TCTGTCAAAC CATAAGAATG 6 9 00 

GCGTACCTCT GCTGGAATAT CAGCATGAGT TTGAGTCGTT GGATAAGTAA TAAGACTTTC 6960 

CACTCCACCC AAACTTTCCG CAAAAGAGAA GACCTTGAGA CTGTTCAAAA TATGAGGAAT 7 02 0 

GCGTGTTTCA TCGGCTACTT TAAAGGAAAT CATGCCTCCA CGACCAGTGT AG AG AACTT C 7 080 

CTTAACTGCT GGAGAATCCT TCAAAAAGGC AACCACTTCT TGGGCGTTAG CTGTTGAGCG 714 0 

CTCCATACGA AGAGACAAGG TCTTGAGACC ACGAAGCAAC TGGTAGCTGT CAAATGGAGA 7 2 00 

CAAGACTGCC CCTGTTGTAT TAAGATTGTA AAAAAGCTTC TCGTATAGTT CTAAACTATT 72 60 

GGTCACAACC ACTCCAGCCA AGACATCATT GTGGCCTGCT AGATACTTGG TTGCTGAATG 73 2 0 

GAGAACGATA TCTGCTCCAT CTTCAATCGG ACGTTGGTAG ATAGGGCTAT AGAAGGTATT 7 3 80 

GTCCACCACC ACTTTGGCAC CCTTAGCATG AGCCAATTTT GCTAGTTTTT CGATATCAAA 744 0 

TTCCAACATC AAGGGATTGG TTGGGGTTTC GATATAGAGA ACATCCACAT CCTTTTCTAA 7 500 

CTCGGCAATC AACTCTTCTT CTGTATTGGC ATAGGTAAAA TGGAAATGAC CTTCCTGCTC 7 5 60 

CACTTGGTTA AACCAGCGAA AAGAACCACC GTAAAGATCA CGCACTGCCA AGACCTTACT 7 62 0 

TCCTACTGGA AAGACGCTAA AGGCCAGTAC AATAGCTGAC ATCCCTGAGC TAGTCGCTAG 7 680 

GGCATAGTCT GCTGACTCAA TAGCCGCCAA GACTTCCTCA GCCTTACTAC GAGTTGGATT 774 0 

TTTAGTGCGC GTATAGTCAA ACCCAGTAGA TCGACCAAAC TCTGGATGCT GATAGGTCGT 7 800 

TGAAAAATGA AGTGGTGTCA CCAAAGCACC TGTTGCCTCA TCAGACTTGA TCCCTGCTTG 7 8 60 

TGCTAAAATT GTGTTAATGT GTAATTCCTT GCTCATACAA TTCCTCCAAA TCTATAGTAA 7 92 0 

CTATTGTACC ACT TAT T T TG TATCCTTCGT TTTCTTGTTT TCAAGAGCTA GTTATAGTTT 79 80 

CAAACTATAT AAAAAGGGAG TTTTTCCTGC TCCCTTTAAT AGACTATAAA ATGGTGAATC 8040 

TCAAAAGACA CCTTCACTCT ATCATTTGCT CCTGCACAAA ACGAGCATAA CGCTCATGAT 8100 

TTTCCAGTAG TTCCTTATGA GTTCCTGAGC CAGTGATTTT CCCCTCCTCT AAGAAGAAAA 8160 

TACAATCCAC AT CTTTT AC C GTTGACAAAC GATGCGCTAT AATCACAACC GTCTTCTCCT 8220 

TTAGTACAGA ATAGAGGCTA CTGATAATCG CATACTCAGA ATCCGCATCA AGATTAGCAG 8280 

TGGCTTCATC AAATATAAGA ATTTCAGCAT CTTTT AAGTA GGCTCTAGCT ATTTGAAGTC 8340 
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TTTCGTTCGC 


CCCCCTGACA 


AGAGTCGTCC 


GCGTTCACCA 


ACTTCAGTAT 


CTAGTCCCTC 


8400 


TTTCATGGAG 


CGAATCTCAT 


CACCTAGTGA 


TACTAAGTCT 


AGCACTTTCA 


TC AATT CAT C 


8460 


ATCAGTTACT 


AAGCGATTCA 


AACCGAGACA 


AAGATTGTCA 


CGAATACTGC 


CAGATAAGAC 


8520 


TGCATTATTT 


TGTGAAACCC 


AAGCGATTTT 


ACTTCTCCAT 


TCTTTTAAGT 


TAAAATCATA 


8580 


TATACTTGAT 


TG C T C C ATT A 


GAATATCTCC 


TGAAAGCGGT 


TTATAAAACC 


GCTCTAACAA 


8640 


ACGCACAATC 


GTTGATTTTC 


CTGATCCAGA 


TGGTCCAACA 


AAAGCAATTT 


TTTGCCCCTT 


8700 


GAAAATTGAA 


CAAGTAATAT 


CCTTTAAGAC 


AGGTCGATTT 


TCATCATAAC 


CAAAATAGAC 


8760 


ATGGTTAAAA 


TTCAACCCTC 


GTCCTGATAC 


CGATTTTCCT 


CCCTCAAATT 


TTTCTTTAGG 


8820 


AACTGCAAGC 


AAGTTCTCCA 


GTGCAACTGA 


AGATCCCTTG 


CTCCTAGAAT 


AAACAGTTAC 


8880 


AAAATTAGCT 


AT ATT ACT AA 


TAGGATTAAG 


TAATTGAAAG 


AGGTAAATCA 


AAAACGAAAC 


8940 


CAAGGTTCCC 


ACAGATATAT 


ATCCTGCGCT 


GACCCGATAA 


CCCCCATAGG 


TTAGCATCAC 


9000 


AGCTATAGTC 


GCAAAGATAA 


ATAAGAGAGC 


AAACGGGGTC 


TCAAAAGAAG 


TAACCCTATC 


9060 


TGATTTCAGT 


GAATTGTTTT 


GTACCCTTTC 


AATACAATTA 


TCCAAAACAT 


CCTGTACACT 


9120 


TTTCTCTGCT 


TGGTTAGTCT 


TAATTAATTC 


ATGTTCTTGA 


ATCTTTTCAG 


TCAATTGCCC 


9180 


TGTTAAATTT 


CCTCCTGTAA 


ACGACGACTA 


TACTTTTCAC 


TGATATTGGA 


AAGGGGCAAG 


9240 


ATAATAAACA 


TCATACAAGG 


AAGAGTGATG 


AATAAAAGTA 


GAGAAAGATT 


CCAATCAAGA 


9300 


CTAAATAAGA 


CTACAATGGA 


ACCAAGTACC 


ATAACTAAAC 


TCAGAATAAT 


ATTTGGGAAA 


9360 


GTCGTAATTA 


AAAACTCACG 


AATGACACTC 


GTGTCATTGA 


CAATGGCAGA 


AGTCAACTCC 


9420 


CCACTTTGGC 


T C TTAT C AAA 


GAAGGATTTC 


T CT AC AT AAA 


TCAACCCCTC 


TAT C ACT T T T 


9480 


TTCCTGATTT 


TTGCTATCTT 


TTTTTCACCC 


G AT TG AC T AA 


ACAGATAGTA 


ACCAATAGAA 


9540 


GAAAACAAGG 


CTTGACCAAT 


AAAAATCAAA 


AACGATTGAA 


ATACTTTGGA 


GCCTATATTT 


9600 


TCAATAGAAC 


TCCCATCTAT 


TAAATCCTTT 


AAGATAAGGG 


GAAGCAACAA 


AGCAAGTAGA 


9660 


CTAGACAGAA 


CAAGTAAGAA 


ACTCCCCATA 


ATCACCTTAG 


TATCTACTCT 


TAATAATTTT 


9720 


AATTTCATAA 


ATACTCCTTA 


TAATATTTCA 


ACGGATAAAG 


TCGGGAATAA 


CTCAATTTGA 


9780 


GGATAAAATC 


TAATAAATCT 


TCCTATAACA 


AAACGCATAA 


CATCTAGGAT 


T T TAT AT AC C 


9840 


TG AT ATT AT G 


CGTTTTTAAG 


CACAAAGACT 


TCTTACACAA 


ACTTATCTAC 


AATTAGATTT 


9900 


TATTTGACAT 


GTTTTGCCAA 


TTCTTCTTGG 


GCTTTTTTAT 


TGGATTCTTC 


TTTTTCTTTC 


9960 


AACCATTTTT 


CTCTGGCTTT 


TGCATATTCG 


TCTGTTGTGA 


CAATCTTATC 


TTGTACTTTG 


10020 


AGGTATTTAT 


ATGATTCAAC 


CCCTTTTGTA 


CCGGTTAAAC 


CATAGGCAGC 


AGCAAATGGT 


10080 
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ACGGTTCTTC TCAATGATGG TGTTCCCCCA CGCGAAACAC TTGGAAGAAC TAAAGAACTA 1014 0 

TCAATCAACC AAGCTTGAAT ATCAGCATAT TTCTCATAAC GTTTGGCCGG ATCTTGCTCT 10200 

TTATTAGCTT CTTCCAACAT TTGAGTATAG ACATCCAGTC CAACTGCCTT AGCCTTGTCA 10260 

TTGGCCTCAC CAGGCTCTAG TCCAAGATTT TGCAGAAATC CTCCACTATT AGTATTAAAA 1032 0 

ATATCGAGAT AGGTTGACGG GTCTTGATAA TCAGGTCCCC AACCGCCATG AT AT AAAT C A 103 8 0 

TAATCTTTCT GAGCAGCTGT TTGAGCAAAG TAGCCTGAAC TGTCAAACTC ATCTGATGTT 10440 

AATTGCTGAA TGTCAATCAC TACATTATCA GAACCTAAAA CAGATTCAAT TGATTGTTTG 10500 

ATAGAACTAA CTCCTTGTAT GCCTACTTTA TCTGTTACTT CCACAGTCTT ATCCAAGTGG 105 60 

ATTGGGAATT GAACACCCTT TGCTTCGAGT TCTTTCTTAG CTTCCGCAAA CTTAGCCTTG 10 620 

GCTTTCTCAG GATTGTAGTA AGGGTCTTGA CCATCCGCAA AGTTGATACC TTGCCATTCC 10 680 

TTACCATAGT TGACCATCTT AGAGGCTACA ACTTCACCAA AGTCTTTTCC CTTGATACTG 10740 

ACAAAGTTTG GAGGAACCAC TAGGTTACGC AAAATCTTTG TTGCACCTTC TTTCCCTTCA 10 800 

GACTGAGCCC CATAAGATGT TCTGTCAAAA GCAAAATTGA TAG C CT G AC G GAAGTTTTTA 10 860 

TTGAGAACTG CTTCCTGAGT CGATTTCTTT TCAATGTCAC TTGTTTTAGA AGTATAATTG 10 92 0 

TAAGACTTCC TATCTAGGTT AAAATTAAAG AAATATGAAG TTGAATTTTG CAT ACT AT AG 10 980 

ATGATATTGT TTTTGTATTT TTCTTTAATC CCTTCATAGC TGGAGCTGTT AGGAAAAAGA 11040 

CGAGCCGTAG T AT AAGC AC C AGCTGTAAAA TTACGTTCCA GTGATTCTTG GTCGCTACCA 11100 

TCATAGTAGG TCAATTTCAC ATCGTCTACA AAGACATTCT TAGCATCCCA GTAATTAGGG 11160 

TTTTTCTTAT ATTCAATAGC AGATTTTGAG ACAAGTGCTT TCATCAAGAA AGGTCCATTG 1122 0 

TACAAAATAC T AG ATGG AT C CGCCTTCCCA AAATCATCCC CTTTTGATTT CAGGAAATCT 112 80 

GCATTAACAG GAAAAAGTAT CGTTGCAAGT GTTTTTGAAT TCCAGTAAAG TTCTGGTTTA 11340 

ACCAAAGTAT ATTGAACCGT TTGGTCATCA AGTGCCTTGA CACCGACAGT TGAAAAGTCG 114 00 

CTTGTTTTAC CAGTGATATA GTC AT C C AAA CCAGCAACAG AGTCCTGCAC TAGATACAAG 114 60 

GCTTCTGATT TTTTATCAGC TGCATATTGC AAACCTGTCA CAAAATCCTG GGCAGTTACA 11520 

GGCGCATATT CTTCTCCCTC AGAAGTAAAC C ACT TGG CAT CCTTACGAAG TTTGTAGGTA 11580 

TAGGTCAAAC CGTCCTGAGA AACAGTCCAA TCCTCTGCTA ATGATGGAAT AATATTCCCA 11640 

TATTGGTCAT TTTCTAATAA CCCGTCTACC AAATTTGCAA CAATATCGGA TGTTGCTGCG 117 00 

CGGTTTTCTG CTAGATAGTT CAAGCTAGAT GGATCACTTG AATAAACATA GTTGTAGGTT 117 60 

TTTGACGCCG TGCTAGAATT TCCACACGCG CTCAATAAAA CTCCTGTACC CAGGACAAGA 11820 

CCTGCCAAGG TTAGATATTT GCTCTTAGAC TTTTTCATTT CCGG 118 64 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

T AACTG C ACT AAACATAATA TAAGGAGAGA AAATGTCTGC AATAGAACGT AT T AC AAAAG 6 0 

CTGCTCACTT AATTGATATG AACGATATTA TCCGTGAAGG GAATCCTACT CTACGCGCGA 12 0 

TTGCTGAGGA AGTCACTTTC CCCCTATCTG ACCAGGAAAT CATCCTAGGC GAAAAGATGA 180 

TGCAATTCCT TAAACATTCC CAAGATCCTG TCATGGCTGA AAAAATGGGA CTCCGCGGTG 24 0 

GTGTTGGACT GGCTGCTCCC C AGT T AG AT A TCTCAAAACG CATTATCGCT GTTTTGGTAC 3 00 

CTAATATTGT TGAAGAAGGC GAAACTCCAC AGGAAGCCTA CGATTTGGAA GCCATTATGT 3 60 

ACAATCCAAA AATCGTCTCT CACTCTGTTC AAGATGCTGC TCTTGGCGAA GGAGAAGGTT 42 0 

GCCTGTCTGT TGACCGTAAC GTGCCTGGCT ATGTTGTTCG CCATGCCCGC GTTACTGTTG 4 80 

ACTACTTTGA CAAAGATGGA GAAAAACACC GTATCAAACT CAAAGGCTAC AACTCCATTG 54 0 

TTGTTCAGCA TGAAATTGAC C AC ATT AAC G GTATCATGTT TTACGATCGC AT C AATG AAA 600 

AAGACCCATT TGCAGTTAAA GATGGTTTAC TGATTCTTGA ATAAAGAAAA TCCCGTTGCA 6 60 

AGACGGGGTT TTGTGTTATA ATAGAGGCAT GAAAACAAAT GATATTGTCT ATGGTGTCCA 72 0 

CGCCGTTACC GAAGCCCTCC TTGCAAATAC AGGAAACAAA CTCTACCTCC AAGAAGATCT 780 

CCGAGGTAAG AATGTTGAGA AAGTCAAGGA ACTAGCTACA GAAAAGAAGG TGTCCATTTC 84 0 

TTGGACATCA AAAAAATCTC TCTCTGAGAT TACTGAAGGT GCTGTTCATC AAGGTTTTGT 900 

TCTACGAGTG TCTGAATTTG CCTATAGCGA G CT AG ATT AC ATCCTTGCAA AAACACGCCA 9 60 

AGAAGAAAAT CCACTTCTAT TGATTCTAGA TGGTCTAACC GATCCCCATA ATCTGGGTTC 1020 

TATCTTGCGA ACAGCCGATG CGACCAATGT TTCAGGTGTC ATCATTCCCA AGCACCGTAC 108 0 

TGTCGGAGTA ACTCCTGTCG TTGCCAAAAC AGCCACAGGT GCTATTGAAC ACGTtCCAAT 1140 

TGCCCGAGTG ACCAACCTCA GTCAAACCTT AGGATAAACT TAAGGATGAA GGTTTCTGGA 12 00 

CCTTTGGAAC GGATATGAAC GGTACTCCTT GCCACAAGTG GAATACAAAA GGGAAAATCG 12 60 

CCCTCATCAT TGGAAATGAA GGAAAAGGTA TCTCTAGCAA CATCAAAAAA CAGGTCGATG 132 0 

AAATGATTAC C ATTCCG AT G AATGG AC AT G TTCAAAGCCT T AATG CC AGT GTTGCTGCGG 13 80 
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C C ATT CT CAT GTACGAAGTT TTCCGAAATA G AC T AT AAAA AAGTTTCCAG TCATCTGATT 1440 

GGAAACTTTT TTATGATTAA CTATGTTCTG TAATGAATTT ATAGGCTTCT TGACCAGCGA 15 00 

TAGCTCCATC TCCAACCGCT GTTGTTACTT GGCGAAGGTC TTTCAAGCGA ACATCTCCAA 15 60 

CTGCAAAGAT ACCGTCGACT GCAGTTTTCA TGTGGTTATC TGTCACAATC CATCCTGCCT 162 0 

GATCTTGGAT ATTCAATTCT TTAACAAAAT CGCTAAGAGG GTCCAAACCA ACATAGATAA 168 0 

AGACACCACC GAAGGCTTGT TCTGTCACTT GACCTGTTTT CACATTTTCA AATACGACTG 174 0 

ATTCTACTCG GTTTTCACCC TTGATTTCCC TTACTACAGA ATCCCAGATA AAGCTGATTT 1800 

TTTCATTCGC AAAGGCGCGA TCTTGTAAAA CCTTTTGGGC ACGAAGTTGG TCACGACGGT 1860 

GAACAATGGT AACAGTCTTA GCAAAACGAG TCAAGAAGAG GGCTTCTTCA ACAGCTGAAT 192 0 

CTCCACCACC AACTACCAAT AAATCTTGGT CACGGAAGAA AG C AC CATC A CACACAGCAC 19 80 

AGTAAGAAAC ACCACGACTG TTCAGTTCTT CTTCTCCAGG CACTCCCAAA GGACGGTGTT 2040 

TAGAACCAGT TGCTACGATA AC T GT ACGTG TTTCATATGT TTGGTCATCA GT C AT C ACT T 2100 

TCTTAAAATC ACCATGGCTT CGACATTTTC AACATAACCA TAAATGTGCT CAACACCAAG 2160 

ATTTTCAAGT GGTTCAAACA TCTTTTCAGC CAATTCAGGT CCACTAATAT TAGCGTATCC 2220 

TGGGTAATTT T CG AT AT C AG ATGTATTATT CATCTGACCA CCTGGCAGAC CACCTTCAAT 22 80 

CAAAGCTACT TTTAGATTGC TTCGAGCAGC ATACAAGGCC GCAGTCATCC cTGCAGGTCC 234 0 

AGCACCGATA ATAATAGTAT CGTACATATA GATTCCTTCT TTCTTGGTGT AACTATCTTT 2 400 

ATTCTAACTC TG 2412 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CCGATTTGGT GGAATTTTTG TCTCATCATT TAGAAGGTGT TGCAAGAGCA GAGTTTACCT 60 

TGGTGCTTCA TACCAAATTG GGAGAAGCCT CTGTTTTGGC AAATATTGTA GATGTAAACA 120 

AGGATGAATG GATTTTAGGA ACAGTTGCTG GTGCCAATAC CTTATTGGTT ATTTGTCGAG 180 

ATCAGCACGT TGCCAAACTC ATGGAAGATC GTTTGCTAGA TTTGATGAAA GATAAGTAAG 240 

GTCTTGGGAG TTGCTCTCAA GACTTATTTT TGAAAAGGAG AGACAGAAAA TGGCGATAGA 300 

AAAGTT AT C A CCCGGCATGC AACAGTATGT GGATATTAAA AAGCAATATC CAGATGCTTT 3 60 
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TTTGCTCTTT CGGATGGGTG ATTTTTATGA ATTATTTTAT GAGGATGCGG TCAATGCTGC 42 0 

GCAGATTCTG GAAATTTCCT TAACGAGTCG CAACAAGAAT GCCGACAATC CGATCCCTAT 480 

GGCGGGTGTT CCCTATCATT CTGCCCAACA G TAT AT CG AT GTCTTGATTG AGCAGGGTTA 54 0 

TAAGGTGGCT ATCGCAGAGC AGATGGAAGA TCCTAAACAA GCAGTTGGGG TTGTTAAACG 600 

AGAGGTTGTT CAGGTCATTA CGCCAGGGAC AGTGGTCGAT AGCAGTAAGC CGGACAGTCA 660 

GAATAATTTT TTGGTTTCCA TAGACCGCGA AGGCAATCAA TTTGGCCTAG CTTATATGGA 72 0 

TTTGGTGACG GGTGACTTTT ATGTGACAGG TCTTTTGGAT TTCACGCTGG TTTGTGGGGA 7 80 

AATCCGTAAC CTCAAGGCTC GAGAAGTGGT GTTGGGTTAT GACTTGTCTG AGGAAGAAGA 84 0 

ACAAATCCTC AGCCGCCAGA TGAATCTGGT ACTCTCTTAT GAAAAAGAAA GCTTTGAAGA 9 00 

CCTTCATTTA TTGGATTTGC GATTGGCAAC GGTGGAGCAA AC GGC AT CT A GTAAGCTGCT 960 

CCAGT AT GTT C AT CGG ACT C AGATGAGGGA ATTGAACCAC CTCAAACCTG TTATCCGCTA 102 0 

CGAAATTAAG GATTTCTTGC AGATGGATTA TGCGACCAAG GCTAGTCTGG ATTTGGTTGA 108 0 

GAATGCTCGC TCAGGTAAGA AACAAGGCAG TCTTTTCTGG CTTTTGGATG AAACCAAAAC 1140 

GGC T ATGGGG ATGCGTCTCT TGCGTTCTTG GATTCATCGC CCCTTGATTG ATAAGGAACG 12 00 

AATCGTCCAA CGTCAAGAAG TAGTGCAGGT CTTTCTCGAC CATTTCTTTG AGCGTAGTGA 1260 

CTTGACAGAC AGTCTCAAGG GTGTTTATGA CATTGAGCGC TTGGCTAGTC GTGTTTCTTT 1320 

TGGCAAAACC AATCCAAAGG ATCTCTTGCA GTTGGCGACT ACCTTGTCTA GTGTGCCACG 13 80 

GATTCGTGCG ATTTTAGAAG GGATGGAGCA ACCTACTCTA GCCTATCTCA TCGCACAACT 144 0 

GGATGCAATC CCTGAGTTGG AGAGTTTGAT TAGCGCAGCG ATTGCTCCTG AAGCTCCTCA 1500 

TGTGATTACA GATGGGGGAA TTATCCGGAC TGGATTTGAT GAGACTTTAG ACAAGTATCG 15 60 

TTGCGTTCTC AGAGAAGGGA CTAGCTGGAT TGCTGAGATT GAGGCTAAGG AG CGAGAAAA 162 0 

CTCTGGTATC AGCACGCTCA AGATTGACTA CAATAAAAAG GATGGCTACT ATTTTCATGT 1680 

GACCAATTCG CAACTAGGAA ATGTGCCAGC TCACTTTTTC CGCAAGGCGA CGCTGAAAAA 1740 

CTCAGAACGC TTTGGAACCG AAGAATT AG C CCGTATCGAG GG AG AT ATGC TTGAGGCGCG 1800 

TGAGAAGTCA GCCAACCTCG AATACGAAAT AT T T ATGC GC ATTCGTGAAG AGGTCGGCAA 1860 

GTACATCCAG CGTTTACAAG CTCTAGCCCA AGGAATTGCG ACGGTTGATG TCTTACAGAG 192 0 

TCTGGCGGTT GTGGCTGAAA CCCAGCATTT GATTCGACCT GAGTTTGGTG ACGATTCACA 1980 

AATTGATATC CGGAAAGGGC GCCATGCTGT CGTTGAAAAG GTTATGGGGG C T C AG AC C T A 2 04 0 

TATTCCAAAT ACGATTCAGA TGGCAGAAGA TACCAGTATT CAACTGGTTA CAGGGCCAAA 2100 
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CATGAGTGGG 


AAGTCTACCT 


ATATGCGTCA 


GTTAGCCATG 


ACGGCGGTTA 


TGGCCCAGCT 


2160 


GGGTTCCTAT 


GTTCCTGCTG 


AAAGCGCCCA 


TTTACCGATT 


TTTGATGCGA 


TTTTTACCCG 


2220 


TATCGGAGCA 


GCAGATGACT 


TGGTTTCGGG 


TCAGTCAACC 


TTTATGGTGG 


AG ATG AT GG A 


2280 


GGCCAATAAT 


GCCATTTCGC 


ATGCGACCAA 


GAACTCTCTC 


ATTCTCTTTG 


ATGAATTGGG 


2340 


ACGTGGAACT 


GCAACTTATG 


ACGGGATGGC 


TCTTGCTCAG 


TC CAT CAT CG 


AAT AT AT CCA 


2400 


TGAGCACATC 


GG AG CTAAG A 


CCCTCTTTGC 


GACCCACTAC 


CATGAGTTGA 


CTAGTCTGGA 


2460 


GTCTAGTTTA 


CAACACTTGG 


TCAATGTCCA 


CGTGGCAACT 


TTGGAGCAGG 


ATGGGCAGGT 


2520 


CACCTTCCTT 


CACAAGATTG 


AACCGGGACC 


AGCTGATAAA 


TCtACGGTAT 


CCATGTTGCC 


2580 


AAGATTGCTG 


GCTTGCCAGC 


AGACCTTTTA 


GCAAGGGCGG 


ATAAGATTTT 


GACTCAGCTA 


2640 


GAGAATCAAG 


GAACAGAGAG 


TCCTCCTCCC 


ATGAGACAAA 


CTAGTGCTGT 


CACTGAACAG 


2700 


ATTTCACTCT 


TTGATAGGGC 


AGAAGAGCAT 


CCTATCCTAG 


CAGAATTAGC 


TAAACTGGAT 


2760 


GTGTATAATA 


TGACACCTAT 


GCAGGTTATG 


AATGTCTTAG 


T AG AGT T AAA 


ACAGAAACTA 


2820 


TAAAACCAAG 


ACTCACTAGT 


TAATCTAGCT 


GTATCAAGGA 


GACTTCTTTG 


ACAATTCTCC 


2880 


ACTTTTTTGC 


TAGAATAACA 


TCACACAAAC 


AGAATGAAAA 


GGAGCTGACG 


CATTGTCGCT 


2940 


CCCTTTTGTC 


TATTTTTTAA 


GGAGAAAGTA 


TGCTGATTCA 


GAAAATAAAA 


ACCTACAAGT 


3000 


GGCAGGCCCT 


GGCTTCGCTC 


CTGATGACAG 


GCTTGATGGT 


TGCTAGTTCA 


CTTCTGCAAC 


3060 


CGCGTTATCT 


GCAGGAAGTC 


TTAGGCGCCC 


TCCTTACTGG 


GAAATATGAA 


GCTATTTATA 


3120 


GTATCGGGGC 


TTGGTTGATT 


GGTGTGGCCG 


TAGTCGGTCT 


AGTTGCTGGT 


GGACTCAATG 


3180 


TTGTCCTCGC 


AGCCTATATT 


GCCCAAGGAG 


TTTCATCCGA 


CCTTCGGGAG 


GATGCCTTCC 


3240 


GTAAAATTCA 


AACCTTTTCT 


TATGCTGATA 


TTGAACAATT 


TAATGCGGGA 


AATCTAGTCG 


3300 


TTCGAATGAC 


AAATGATATC 


AAC C AG ATT C 


AGAACGTTGT 


CATGATGACC 


TTCCAAATTC 


3360 


TT T T C AG AC T 


TCCCCTCTTG 


TTCATCGGTT 


CGTTTATCCT 


AGCGGTTCAA 


ACCTTACCTT 


3420 


CTCTGTGGTG 


GGTGATTGTT 


CTCATGGTAG 


TCTTGATTTT 


TGGTTTGACT 


GCTGTCATGA 


3480 


TGGGAATGAT 


GGGGCCTCGT 


TTTGCCAAGT 


TTCAAACCCT 


TCTTGAGCGC 


ATCAATGCCA 


3540 


TTGCCAAGGA 


AAATTTACGT 


GGCGTTCGTG 


TGGTCAAGTC 


CTTTGTCCAA 


G AAAAAG AG C 


3600 


AATTTGCTAA 


GTTTACAGAG 


GTCTCAGACG 


AGCTTCTTGG 


TCAAAACCTT 


TACATTGGTT 


3660 


ATGCCTTTTC 


AGTAGTGGAA 


CCCTTTATGA 


TGTTGGTTGG 


TTACGGGGCG 


GTCTTCCTCT 


3720 


CTATTTGGCT 


GGTCGCGGGA 


ATGGTTCAGT 


CGGATCCGTC 


TGTTGTTGGT 


TCCATCGCTT 


3780 


CTTTTGTTAA 


TTACCTAAGC 


C AG ATT AT C T 


TTACCATTGT 


TATGGTTGGA 


TTTTTGGGAA 


3840 


ATTCTGTCAG 


CCGTGCCATG 


ATTTCCATGC 


GTCGTATTCG 


AGAAATTCTT 


GACGCAGAGC 


3900 
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CAGCTATGAC CTTCAAGGAT ATCCCAGATG AAGAGTTGGT TGGAAGTCTT AGCTTTGAAA 3 9 60 

ATGTGACCTT TACCTATCCA ATGGACAAGG AACCGATGCT GAAAGATGTG AGCTTTACTA 402 0 

TTGAACCTGG TCAAATGGTT GGTGTAGTTG GAGCGACTGG TGCAGGAAAG TCAACCTTGG 4080 

CTCAATTGAT TCCACGTCTC TTTGATCCAC AGGACGGGGC CATTAAAATC GGTGGCAAGG 414 0 

ATATTCGAGA AGTGAGTGAA GGAACCCTGC GTAAAACAGT TTCCATCGTT CTCCAACGTG 42 00 

CCATTCTTTT TAGTGGAACG ATTGCAGATA ACTTGAGACA GGGGAAGGGG AATGCTACTC 42 60 

TATTTGAAAT GGAGCGCGCA GCCAATATTG CCCAGGCTAG TGAATTCATT CATCGTATGG 4320 

AGAAAACCTT TGAAAGTCCA GTTGAAGAAC GGGGAACCAA TTTCTCTGGT GGACAAAAAC 43 80 

AAAGGATGTC GATTGCGCGT GGGATTGTCA GCAATCCACG TATTCTGATT TTTGATGATT 444 0 

CGACCTCAGC CTTGGATGCC AAAT C AG AG C GCTTGGTGCA AGAAGCTTTG AATAAGGACT 4 5 00 

TGAAGGGGAC GACAACCATT ATTATTGCTC AAAAAATTAG CTCGGTTGTC CAT GC AG AC A 4 5 60 

AGATCTTGGT TCTAAATCAA GGACGATTGA TTGGTCAAGG TACGCATGCA GACTTGGTTG 4 62 0 

CCAACAATGC CGTTTACCGT GAAATCTATG AAACACAGAA AT G AAAG AC A AACTATAAGA 4 68 0 

AAAGTCAATA GTTTTATCTA AACTATTTCT TATTTCAATT TGATGATTTG GCGATGATTT 4 74 0 

TAGAGCACGG CAAAAAGCCC TTGAAAAAGT CCATTTTTTC AAAGGT AAT C CTGTGTTAAT 4 800 

TTCAGAAATT ACATCACTTT TTGTTCGTCA AATGGCAGCT CTTTTTTTAG GATATAAAAC 4860 

AGGGTTCGGA TAAGTTTTTT TGCAAGGTGG ATGATGGCTA CATTGTAATG TTTTCCTTGT 4 920 

TCTAATTTAG TCTTAAGATA GGCCTTAAAA GCAGGCGAAA AGCGAGGGCA TGCTTTGGCA 4 9 80 

GCTTGTATGA GTACCTACCG CAGATGAGGG GAACTCCGTT TGACCATTCT TCCTGCTAAA 5 040 

TCAATCTGAT CTGACTGATA AATAGAAGAA TCCAGTCCAG CGAAAGCTTG T AATTG AG C A 5100 

GG AT TAT C AA AGGCATGAAT ATTTCGAATC TCAGCTAAAA TGACCGCCCC TAAACGATCC 5160 

CCAATCCCAG TAACCGTCGT GATGACCGAG TTGAACTCAG CCATCAAGTC ATTGACACAT 522 0 

GTTTCCGCCT TGTCAATGAG CCTCTTGTAA TGTTTGATGT TTTCATTACA CGAGATAAAA 5280 

CGTCTATGCG TTATCAAACT CATTACCAAT TAAAACAAAA AGCTGTGGTT AGATCCTTTC 534 0 

GGAAATTGTC AAGCGATTGG AGGAAATGAA CTAATCCACA GCGGCTTATT CCAAGTATAC 5400 

CACTTGGGCT TTGGCAGTAG CTAACTGCGC T AAAT AT AAT ATAAGGAGGA GTAAAATGAA 54 60 

GACAGTTCAA TTTTTTTGGC ATTATTTTAA GGTCTACAAG TTCTCATTTG TAGTTGTCAT 552 0 

CCTGATGATT GTTCTGGCGA CTTTTGCCCA AGCCCTCTTT CCAGTCTTTT CTGGACAAGC 55 80 

GGTGACGCAG CTAGCCAATT TAGTTCAAGC TTATCAAAAT GGCAATCCAG AACTTGTATG 5640 
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GCAAAGCCTA TCAGGAATCA TGGTCAATCT TGGCCTGCTG GTTTTGGTTC TATTTATCTC 57 00 

TAGTGTAATA TACATGTGTC TCATGACGCG CGTGATTGCA GAATCGACCA ACGAGATGCG 57 60 

CAAAGGCCTC TTTGGTAAGC TTGCTCAGTT GACGGTTTCT TTCTTTGACC GTCGACAAGA 582 0 

TGGCGATATC CTGTCTCATT TTACCAGTGA TTTGGATAAT ATCCTCCAAG CCTTTAACGA 5880 

AAGCTTGATT CAGGTCATGA GCAATATTGT TTTATACATT GGTCTGATTC TTGTCATGTT 5 940 

TTCGAGAAAT GTGACGCTGG CTCTCATCAC CATTGCCAGC ACCCCATTGG CTTTCCTTAT 6000 

GCTGATTTTC ATCGTGAAAA TGGCACGCAA ATACACCAAC CTCCAGCAGA AAGAGGTAGG 6060 

GAAGCTCAAC GCCTATATGG ATG AG AG CAT CTCAGGCCAA AAAGCCGTGA TTGTGCAAGG 612 0 

AATTCAAGAG GATATGATGG CAGGATTTCT TGAACAAAAT GAGCGCGTGC GCAAGGCAAC 618 0 

CTTTAAAGGA AG AATGT T CT CAGGAATTCT TTTCCCTGTC ATGAATGGGA TGAGCCTGAT 6240 

TAATACAGCC ATCGTCATCT TTGCTGGTTC GGCTGTACTT TTGAATGATA AGTCTATTGA 6300 

AACAAGTACA GCCCTAGGTT TGATTGTTAT GTTTGCACAA TTTTCACAGC AGTACTACCA 63 6 0 

G C C T ATT AT C CAAGTTGCAG CGAGTTGGGG AAGCCTTCAG TTGGCCTTTA CTGGAGCTGA 642 0 

ACGAATTCAG GAAATGTTTG ATGCAGAGGA GGAAATCCGA CCTGAAAAGG CTCCAACCTT 6480 

CACTAAGTTG CAAGAAAGTG TTGAAATCAG TCATATCGTT TTTTCATACT TGCCTGATAA 654 0 

ACCTATTTTG AAAGATGTCA GCATTTCTGC CCCTAAAGGC CAGATGACAG CAGTTGTTGG 6600 

GCCGACAGGT TCAGGAAAAA CGACTATTAT GAACCTCATC AATCGCTTTT ATGATGTTGA 66 60 

TGCTGGTGGT ATTTATTTTG ATGGTAAAGA CATTCGTGGC TATGACTTAG ATAGTCTTAG 672 0 

AAGCAAGGTG GGAATTGTAT TGCAAGATTC GGTCTTGTTT AG CGG AACG A TTAGAGACAA 67 8 0 

TATCCGATTT GGTGTGCCAG ATGCTAGTCA GGAAATGGTT GAGGTAGCAG CAAAAGCAAC 6840 

CCACATTCAC GACTATATCG AAAGTTTGCC TGATAAGTAC GATACTCTTA TTGATGATGA 6 900 

C C AG AG CAT C TTTTCAACAG GGCAGAAGCA ATTGATTTCA ATCGCTCGAA CCCTGATGAC 6960 

AGATCCAGAA GTTCTCATTC TCGATGAAGC AACTTCAAAC GTAGATACGG TGACAGAAAG 7 02 0 

CAAGATTCAG CATGCCATGG AGGTGGTTGT AGCAGGTAGA ACTAGTTTCG TCATTGCCCA 7 080 

CCGCTTGAAA ACCATTCTCA ATGCAGATCA GATTATTGTC CTTAAAGATG GAGAAGTCAT 7140 

TGAACGTGGT AAC C AC CAT G AACTTTTGAA GCTAGGTGGC TTTTATTCAG AACTCTATCA 72 00 

CAATCAATTT GTTTTCGAAT AAGAAAGAAG TTGTCCTATG TGGGCAGCTT TTTCTTGTCC 72 6 0 

ATAAAAAATG TTTATCACAG CCTTAAAAAA AACATATTAG ACGAAAGTCA TTTTGAGTGA 7320 

TATGATAGGA CTATCGTTAG CATTCGAAAG GAGAGGCATC ATGGCTAGAA CGGTTGTAGG 73 80 

AGTTGCTGCA AATCTATGTC CCGTAGACGC AGAAGGCAAA ATCATTCATT CATCTGTATC 744 0 
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TTGTAGATTC GCAGAGATCA TTCGTCAAGT CGGTGGTCTC CCTTTAGTCA TTCCTGTTGG 



7500 



TGATGAGTCA GTTGTACGTG ATTATGTGGA AATGATTGAC AAACTCATTT TGACAGGAGG 



7560 



CCAAAATGTT CATCCTCAGT TTTATGGAGA G AAAAAG AC C GT C GAG AG CG ATGATTACAA 



7620 



TCTGGTCCGT GACGAATTTG AATTGGCACT CTTGAAGGAA GCGCTTCGTC AGAATAAACC 



7680 



AATTATGGCA ATCTGTCGCG GTGTCCAACT TGTCAATGTT GCCTTTGGTG GAACCCTCAA 



7740 



TCAAGAAATC GAAGGTCAGG 



7760 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GAGGTTTTAA TTCACTTACC TCTsCCGTAT CTTTATTTAA AATGAATTCT TTTACGGTTG 6 0 

TATTTCTTGC AAAATCTTTT ACAACAATCT TAATGTTTAG TGTCTTGTCT ATTATTTGTT 12 0 

TAATATCATT AAATGATGTA TATTCTTTTC CATTTATATA AATATGTTGT TCTTGAATCT 180 

CACCATCGAA T C CAT T ATTT CTTTTATCAT TGATGTTAAA GACTACAGAT TTTCCATCAG 24 0 

CATATTCGAT ACTAGTATTT CCCTTAGGAT CAATGTTTAC TTCGGGTTTA ACATTATCAT 3 00 

ATAAAAACTG AT AGTGG AC T CCAACTGCTT TAG C ATT C AA ATCGCTATAG CCAGTTTGAA 3 60 

GATAAACATT TCCATCCATA TCTGTTACCT TATCTGGAAA TCCGTTTGCT TTATAGTCTT 42 0 

TCATTCCCCA GTCCATGATG TCACCGTCTT TAACATTCAG CTTAATATTA AAATCTCTAG 480 

TGTTATCAAT GTGTAAATCT CCGTAGATTA AATAATTATC TACAACCGAT TCATTAACTC 540 

TCAATTCCCA GTTAAAACCA CCCTTATCAG AAATCTTACC TCTTAAATAA AATTCTGGAT 600 

TTCGTACATA AATTTTATTA GATTTAGATG GATTAAAGTA GTTCTTATCC ATTGAAAGGT 660 

TTACTGGTTT GGTATCAATA AATAACATGG AGCCATCTTC TTTTATAGCT TCTACATTGA 72 0 

ACTTATCCTC TCCAGTGTAT TCTTTATCAT CCTTACCAAA TAATACAAGT TTAGAAGAAT 780 

CTGTCACAAG ATTTCCGTCT T TAT CG AT AG CTTCCCCTTT ATCGTTCATT TTAAATGTAA 840 

ACACTTGATA CCTTATAATG TTAAAGCCGT CCAAAGCCGA CATTAATACA GATTGGGTAC 9 00 

TTCTTCCATC TTCAACATTT CTACTATCAG CATAAATTGT TGTTTCTGAA AGGGCTCTTA 960 

GATTAGGATT GGCCTTTTGT ATTTTTGCTA TATCTTCCTT G C TAT AG AC T CCATTTCCTT 1020 
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CTAACATATC CGTTTTTCCA GGATTATAGG TAGTCACTTT TAGTGCATAG CCTTTTCTTA 10 8 0 

GAATGATATT ATCCTTTAAC AGATATTGTT GTTTTTCTGA AT C AG AAT AG ATTTTACCAG 114 0 

ATTCCATTTT AGTTAAATTG TCTGGTTTGT TTTTTGAAAG ATCTCCTTCC CCTAATTCTA 1200 

TGACATTCCC ATAACTTGAT ACATAGGGAT ATTCTGATTT AGTTTCCTTA ATTTTTTCAG 12 60 

GCATTCTAAT TTTAATTTCA GCTTTTTTCT GAT CAT TAT C TTTAACAAAT AAT CTC AT AT 13 2 0 

CTCCTGCAAA AG CT AATCC A TCCACAATAT CATTAATATT AGCGTATAGA TCAAATGTCA 13 80 

TCGTTTTTGA GTGGAAATCA TACTTGGTCG CTTTGATTTC TATAGATTTA TAGTTATTCC 144 0 

CATAATATAC CTTGGCATTT TTAGAAACAT TACTTATCTT TCCAAGAATT TCAAAGTGTC 1500 

CATCTTTAGA CGGACTTAGA ACACCATAAA TTTTTGATTT GATTTCGTCA AGTTTCTCAG 1560 

TTTCATATTC TAGATCAGTC CCATCATCGT AGGCTATTAT ATTTCCTTTA TCATCGTATT 162 0 

TATAATCGTA TTCCTCCATT CTCTTACCAG TTTCACTTGT AAAATCATCA ACTTCTCTAA 1680 

ATTTCTTTTT AATGAGTTTC TTTAAGTCTT TATTTTCAAA GTCTCTAATT GTTGAAATAT 174 0 

TTCTATCAAT AGTAAAACTA GATTTTTCTT T AAT AG ACT C TTCATTTTCT TGATGATGAT 18 00 

GTTCTACCCC AGTTGTATCT TTTTTTAGAC TACCCTCTTT TCCATTTCCT AAATTTTTAA 18 6 0 

ATTTAGATTC TGCAATCTCG CCAAGCTTTT GATATTTAGA TG AAT C T T G A TCAGGATCTA 19 2 0 

C TAG AT AAT A GG AAAT CAT C CCCTTTTCAT CAGCCTGATT AGCAAATTTA ATTCTATGAA 19 80 

TCTTTGTGAA ATTGCTAGAA C C AT CT AAT G CAATGACTTC AATGATTTTT CCCCTTAAAT 2 04 0 

CTCCCGCACC TTTAATTTCA TAAATGGTAT TTCCGTCTTT ATCAAGTTTT CTATTTCTTC 2100 

CTTGACCCTC ACCTGCGTAA GTTACTTCAA GATTTTTTTC AACCTCTCCA TCTTCATTAA 2160 

CAAGAGCGGC GCCAGCATAC CAAACTTCGT TCGCAATCTC GTCAAATTTT TCAGGATGTT 222 0 

CTTTTTGATC TCTCGCAAAT AGCGTTTCAT TCTTATACTG ATCTTTTACC TTATGATAAG 22 80 

TATCCTTTGT AATCAACTTA ATTTTTTCAG GATTTGAAAA ATCAACCGAA ACAATCTTAG 2 340 

GGGCGGTGTT AT C AATTTTT ACAGGAATAT AGGAAACCTG CCATGGGTAA TCTTTAGTTA 2400 

AT C T AT AT TT AAATTTATAG AAAT AT TG AC CTTCCGCAAT CGGTTCAAAT TGACCTCTTA 2460 

TCTTAGTAGC AGGATCTTGA TTATCCTTAC TTTCTGGTGC ATTTTCTTCT CTACCTCTAG 252 0 

GATTATAGAT GAGTCCATCC CACTTCAAGT CACCCCAAAC TTTTAGTTTA GATGATTTGA 2580 

TTCCCTTTGC ATCATTGCTT TTAGAATTTA AAATTCCTCT AATAAAGTGT TCTCTCGAAA 2 640 

TGACTTTTAA GTCTCTTTGA TTTTCTCCCT CTTTATTTGT ATTTACTATT G AAAT C AAT C 27 00 

CTTCTTCTGC ACTTCTTAAT ACA 2723 
(2) INFORMATION FOR SEQ ID NO: 65: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



AAAAAAGTGG 


G AATG ACT C A 


AATCTTCACT 


GAAGCTGGCG 


AATTGATCCC 


TGTAACAGTT 


60 


ATTGAAGCAA 


CTCCAAACGT 


TGTTCTTCAA 


GTTAAAACTG 


TTGAAACAGA 


CGGATACAAC 


120 


GCTATCCAAG 


TTGGTTTCGA 


TGACAAACGC 


GAAGTATTGA 


GCAACAAACC 


TGCTAAAGGA 


180 


CATGTAGCGA 


AAGCTAACAC 


GGCTCCTAAG 


CGCTTCATTC 


GTGAATTCAA 


AAACGTTGAA 


240 


GGCTTGGAAG 


TTGGTGCTGA 


AATTACAGTT 


GAAACATTCG 


CAGCTGGAGA 


CGTTGTTGAC 


300 


GTAACGGGTA 


CTTCTAAAGG 


TAAAGGTTTC 


CAAGGTGTTA 


TCAAACGCCA 


CGGACAATCA 


360 


CGTGGACCAA 


TGGCTCACGG 


TTCTCGTTAC 


CACCGTCGTC 


CAGGTTCTAT 


GGGGCCTGTT 


420 


GCACCTAACC 


GCGTATTCAA 


AGGTAAAAAC 


CTTGCAGGAC 


GTATGGGTGG 


CGACCGCGTA 


480 


ACAATTCAAA 


ACCTTGAAGT 


TGTACAAGTT 


GTTCCAGAAA 


AG AAC GTT AT 


CCTTATCAAA 


540 


GGTAACGTAC 


CAGGTGCTAA 


GAAATCTCTT 


ATCACTATCA 


AATCAGCAGT 


TAAAGCTGGT 


600 


AAATAATAAA 


GAAAGGGGAA 


ATCAGTCACA 


ATGGCAAACG 


TAACATTATT 


TGACCAAACT 


660 


GGTAAAGAAG 


CTGGCCAAGT 


TGTTCTTAGC 


GATGCAGTAT 


TTGGTATCGA 


ACCAAATGAA 


720 


TCAGTTGTGT 


TTGATGTAAT 


CATCAGCCAA 


CGCGCAAGCC 


TTCGTCAAGG 


AACACACGCT 


780 


GTTAAAAACC 


GCTCTGCAGT 


ATCAGGTGGT 


GGACGCAAAC 


CATGGCGTCA 


AAAAGGAACT 


840 


GGACGTGCTC 


GTCAAGGTTC 


TATCCGCTCA 


CCACAATGGC 


GTGGTGGTGG 


TGTTGTCTTC 


900 


GGACCAACTC 


CACGTTCATA 


CGGCTACAAA 


CTTCCACAAA 


AAGTTCGTCG 


CCTAGCTCTT 


960 


AAATCAGTTT 


ACTCTGAAAA 


AGTTGCTGAA 


AACAAATTCG 


TAGCTGTAGA 


CGCTCTTTCA 


1020 


TTTACAGCTC 


CAAAAACTGC 


TGAATTTGCA 


AAAGTTCTTG 


CAGCATTGAG 


CATCGATTCT 


1080 


AAAGTTCTTG 


TTATCCTTGA 


AGAAGGAAAT 


GAATTCGCAG 


CTCTTTCAGC 


TCGTAACCTT 


1140 


CCAAACGTGA 


AAGTTGCAAC 


TGCTACAACT 


GCAAGTGTTC 


TTGACATCGC 


AAAT AG CG AC 


1200 


AAACTTCTTG 


T C AC AC AAGC 


AGCTATCTCT 


AAAATCGAGG 


AGGTTCTTGC 


ATAATGAATT 


1260 


TGTATGATGT 


TATCAAAAAA 


CCTGTCATCA 


CTGAAAGCTC 


AATGGCTCAA 


CTTGAAGCAG 


1320 


GAAAATATGT 


ATTTGAAGTT 


G AC ACT CGT G 


CACACAAACT 


TTTGATCAAG 


CAAGCTGTTG 


1380 


AAGCTGCTTT 


CGAAGGTGTT 


AAAGTTGCCA 


AT GTT AAC AC 


AATCAACGTA 


AAACCAAAAG 


1440 
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CTAAACGTGT TGGACGTTAC ACTGGTTTTA CTAACAAAAC TAAAAAAGCT AT CATC AC AC 15 00 

TTACAGCTGA TTCTAAAGCA ATCGAGTTGT TTGCTGCTGA AGCTGAATAA TCTAAGGAGG 15 6 0 

AAATATCGTG GGAATTCGTG TTTATAAACC AACAACAAAC GGTCGCCGTA ATATGACTTC 1620 

TTTGGATTTC GCTGAAATCA CAACAAGCAC TCCTGAAAAA TCATTGCTTG TTGCATTGAA 1680 

GAGCAAGGCT GGTCGTAACA ACAACGGTCG TATCACAGTT CGTCACCAAG GTGGTGGACA 174 0 

CAAACGTTTC TACCGTTTGG TTGACTTCAA ACGTAATAAA GACAACGTTG AAGCAGTTGT 1800 

TAAAACAATC GAGTACGATC CAAACCGTTC TGCAAACATC GCTCTTGTAC ACTACACTGA 18 60 

CGGTGTGAAA G CAT AC AT C A TCGCTCCAAA AGGTCTTGAA GTAGGTCAAC GTATCGTTTC 192 0 

AGGTCCAGAA G C AG AT AT C A AAGT CGG AAA CGCTCTTCCA CTTGCTAACA TCCCAGTTGG 19 80 

TACTTTGATT CACAACATCG AGTTGAAACC AGGTCGTGGT GGTGAATTGG TACGTGCTGC 2 040 

TGGTGCATCT GCTCAAGTAT TGGGTTCTGA AGGTAAATAT GTTCTTGTTC GTCTTCAATC 2100 

AGGTGAAGTT CGTATGATTC TTGGAACTTG CCGTGCTACA GTTGGTGTTG TCGGAAACGA 2160 

AC AAC AT GG A CTTGTAAACC TTGGTAAAGC AGGACGTAGC CGTTGGAAAG GTATCCGCCC 222 0 

AACAGTTCGT GGTTCTGTAA TG AAC C CT AA CGATCACCCA CACGGTGGTG GTGAAGGTAA 22 80 

AGCACCAGTT GGTCGTAAAG CACCATCTAC TCCATGGGGC AAACCTGCTC TTGGTCTTAA 2340 

AACTCGTAAC AAGAAAGCGA AATCTGACAA AC T T AT CGTT CGTCGTCGCA ACGAGAAATA 2400 

ATATTAAACT AGTCGCTTAA GCAACTAGTA AATCCGCCAG CTCGGTAGCG C TC CAT AG G A 24 60 

GTGCAAGCCG CTGTGGTACA ACATTTAAAG GAGAAAATAT AAAAATGGGA CGCAGTCTTA 252 0 

AAAAAGG AC C TTTCGTCGAT GAGCATTTGA TGAAAAAAGT TGAAGCTCAA GCTAACGACG 2 5 80 

AAAAGAAAAA AGTTATTAAA ACTTGGTCAC GTCGTTCAAC GATCTTCCCA AGTTTCATTG 2 64 0 

GTTACACTAT TGCAGTTTAT GACGGACGTA AACACGTACC TGTTTACATC CAAGAAGACA 2700 

TGGTAGGCCA CAAACTTGGT GAATTTGCAC CAACTCGTAC TTACAAAGGT CACGCTGCAG 2 7 60 

ACGACAAGAA AAC AC GT AG A AAATAAGGAG AACATAAATG GCAGAAATTA CTTCAGCTAA 2 82 0 

AGCAATGGCT CGTACAGTAC GTGTTTCACC TCGTAAATCA CGTCTTGTTC TTGATAACAT 2 880 

CCGTGGTAAA AGCGTAGCCG ATGCAATCGC AATCTTGACA TTCACTCCAA ACAAAGCTGC 2940 

TG AAAT CAT C TTGAAAGTTT TGAACTCAGC TGTAGCTAAC GCTGAAAACA ACTTTGGTTT 3000 

GGATAAAGCT AACTTGGTAG TATCTGAAGC ATTCGCAAAC GAAGGACCAA CTATGAAACG 3060 

TTTCCGTCCA CGTGCGAAAG GTTCAGCTTC ACCAATCAAC AAACGTACAG CTCACATCAC 312 0 

TGTAGCTGTT G C AG AAAAAT AAGGAGGTAA AATCGTGGGT CAAAAAGTAC ATCCAATTGG 3180 

TATGCGTGTC GGCATCATCC GTGATTGGGA TGCCAAATGG TATGCTGAAA AAGAATACGC 3240 
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GGATTACCTT CATGAAGATC TTGCAATCCG TAAATTCGTT CAAAAAGAAC TTGCTGACGC 33 00 

AGCAGTTTCA ACT AT TG AAA TCGAACGCGC AGTAAACAAA GTTAACGTTT CACTTCACAC 3 3 60 

TGCTAAACCA GGTATGGTTA TCGGTAAAGG TGGTGCTAAC GTTGATGCaC TCCGTGCAAA 3 42 0 

ACTTAACAAA TTGACTGGAA AACAAGTACA CATCAACATC ATCGAAATCA AACAACCTGA 34 80 

TTTGGATGCT CACCTTGTAG GTGAAGGAAT TGCTCGTCAA TTGGAGCAAC GTGTTGCTTT 3 54 0 

CCGTCGTGCA CAAAAACAAG CAATCCAACG TGCAATGCGT GCTGGAGCTA AAGGAATCAA 3 600 

AACTCAAGTA TCAGGTCGTT TGAACGGTGC AGATATCGCC CGTGCTGAAG GATACTCTGA 3 660 

AGGAACTGTT CCGCTTCACA CACTTCGTGC AGATATCGAT TACGCTTGGG AAGAAGCAGA 3720 

TACTACATAC GGTAAACTTG GTGTTAAAGT ATGGATCTAC CGTGGTGAAG TTCTTCCAGC 37 80 

TCGTAAAAAC ACTAAAGGAG GTAAATAACC AATGTTAGTA CCTAAACGTG TTAAACACCG 3 84 0 

TCGTGAGTTC CGTGGAAAAA TGCGCGGTGA AGCAAAAGGT GGAAAAGAAG TAGCATTCGG 3 9 00 

TGAATACGGT CTTCAAGCTA CAACTAGCCA CTGGATCACT AACCGCCAAA TCGAAGCTGC 3 960 

TCGTATCGCC ATGACTCGTT ACATGAAACG TGGTGGTAAA GTTTGGATTA AAATCTTCCC 4 02 0 

AC AC AAAT C A T AC AC TG CT A AAGCT AT CGG TGTGCGTATG GGATCTGGTA AAGGGGCACC 408 0 

TGAAGGTTGG GTAGCACCAG TTAAACGTGG TAAAGTGATG TTCGAAATCG CTGGTGTATC 414 0 

TGAAGAGATT GCACGTGAAG CGCTTCGACT TGCTAGCCAC AAATTGCCAG TTAAATGTAA 42 00 

ATTCGTAAAA CGTGAAGCAG AATAAGGAGA AGGCATGAAA CTTAATGAAG TAAAAGAATT 42 60 

TGTTAAAGAA CTTCGTGGTC TTTCTCAAGA AGAACTCGCG AAGCGCGAAA ACGAATTGAA 4 32 0 

AAAAGAATTG TTTGAACTTC GTTTCCAAGC TGCTACTGGT CAATTGGAAC AAACAGCTCG 43 8 0 

CTTGAAAGAA GTTAAAAAAC AAATCGCTCG CATCAAAACA GTTCAATCTG AAGCGAAATA 444 0 

ATAGACTAGG GAAGGAGAAA TTTCAATGGA ACGCAATAAT CGTAAAGTTC TTGTTGGACG 4500 

TGTTGTATCT GACAAAATGG ACAAGACAAT CACAGTTGTA GTTGAAACAA AACGTAACCA 4 5 60 

CCCAGTCTAT GGTAAACGTA TTAACTACTC TAAAAAATAC AAAGCTCATG ATGAAAACAA 4 620 

TGTTGCCAAA GAAGGCGATA TCGTACGTAT CATGGAAACT CGCCCGCTTT CAGCTACAAA 4680 

ACGTTTCCGT CTTGTAGAAG TTGTTGAAGA AGCGGTCATC ATCTAATCAA ACCTGAAAGG 4740 

AGAAAACTGA AATGATTCAA AC AG AAACT C GTTTGAAAGT CGCAGACAAC AGCGGTGCTC 4 800 

GCGAAATCTT GACT AT C AAA GTTCTTGGTG GTTCAGGACG TAAATTTGCA AACATCGGTG 48 60 

ATGTTATCGT GGCATCTGTA AAACAAGCTA CTCCTGGTGG TGCGGTTAAA AAAGGTGACG 492 0 

TTGTTAAAGC AGTTATCGTT C GT ACT AAAT CAGGTGCTCG TCGTGCTGAT GGT T CAT AC A 4980 
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TCAAATTTGA CGAAAACGCA GCAGTTATCA TCCGTGAAGA CAAAACTCCT CGCGGAACAC 504 0 

GTATCTTTGG CCCAGTTGCA CGTGAATTGC GTGAAGGTGG CTTCATGAAG ATCGTGTCAC 5100 

TTGCTCCAGA AGTACTTTAA TTTTTAGGAA CAAACTAGTC CCCTAGCTTC AAGCTAGGGT 5160 

GCCCTTATGG GCGTAAGAAA AATCAAGGAG AAACCTAATG TTTGTAAAAA AAGGCGACAA 52 2 0 

AGTTCGCGTA ATCGCTGGTA AAGATAAGGG AACAGAAGCT GTTGTCCTTA CTGCCCTTCC 52 80 

AAAAGTAAAC AAAGTTATCG TTGAAGGTGT TAACATTGTT AAGAAAC AC C AACGTCCAAC 53 4 0 

TAACGAGCTT CCTCAAGGTG GTATCATCGA GAAAGAAGCA GCTATCCACG TATCAAACGT 54 0 0 

TCAAGTTTTG GACAAAAATG GTGTAGCTGG TCGTGTTGGA TACAAATTTG TAGACGGTAA 5460 

AAAAGTTCGC TACAACAAAA AATCAGGCGA AGTGCTTGAT TAATCACGAA GGAAAGGAGA 552 0 

AGTATAATGG CAAATCGTTT AAAAGAAAAA TATCTTAATG AAGTAGTTCC TGCTTTGACA 558 0 

GAACAATTCA ACTACTCATC AGTGATGGCT GTGCCTAAAG TAGATAAGAT TGTTTTGAAC 5640 

ATGGGTGTTG GTGAAGCTGT ATCAAACGCT AAAAGCCTTG AAAAAGCTGC TGAAGAATTG 57 00 

GCACTTATCT CAGGTCAAAA AC C ACT TAT C ACTAAAGCTA AAAAATCAAT CGCCGGCTTC 57 6 0 

CGTCTTCGTG AAGGTGTTGC GATCGGTGCA AAAGTTACCC TTCGTGGTGA ACGTATGTAC 582 0 

GAATTCTTGG ATAAATTGGT ATCAGTTTCA CTTCCACGTG TACGTGACTT CCACGGTGTC 5880 

CCAACAAAAT CATTTGATGG ACGCGGGAAC T AC AC AC T T G GTGTGAAAGA ACAATTAATC 5 94 0 

TTCCCAGAAA TCAACTTCGA TGACGTTGAC AAAACTCGTG GTCTTGACAT CGTTATCGTA 6000 

ACAACTGCTA ACACTGACGA AGAGTCACGT GCATTGCTTA CAGGCCTTGG AATGCCTTTT 60 60 

GCAAAATAAT ATAGGAGGTA AATCTAATGG CTAAAAAATC AATGGTAGCT AGAGAGGCTA 612 0 

AACGCCAAAA AATTGTTGAC CGTTATGCTG AAAAACGTGC TGCATTAAAG GCGGCAGGGG 6180 

ACTACGAAGG TTTATCTAAA TTACCTCGCA ACGCCTCACC GACTCGTTTA CATAATCGTT 62 40 

GTAGGGTTAC GGGGCGCCCA CATTCAGTTT ACCGCAAATT TGGTCTGAGT CGTATCGCTT 63 00 

TTCGCGAACT TGCGCATAAA GGTCAAATTC CTGGTGTAAC AAAAGCATCT TGGTAATTTA 6 3 60 

AG AT AT C AAG AGCGTCAAAA CTCCAAGTAA AAATAGGAAA CTTGACGAAG AAACTAAAGT 6420 

TTCTAGGAAA GTTTATCTTT TTCACACAGA GTTTAGCCCG GGTTCAATTG GGCTTGCCAA 64 80 

TTTGAACACG AGCTACAGCT TTGGCAAAAA AGACCAATTT GCTTTGGAGC ATTGCTTCTG 654 0 

CATTAAATTG TCTATTTTTG CTCGTGCTGT TACGCTCTTT GTATCATGTA TTAACTAGCA 6600 

AGTGCAACTT GCAAACTACT AGTAAGAGGA GAAAAACAAA ATGGTTATGA CTGACCCAAT 6660 

CGCAGACTTC CTAACTCGTA TTCGTAATGC TAACCAAGCT AAACACGAAG TACTTGAAGT 672 0 

ACCTGCATCA AACATCAAAA AAGGGATTGC TGAAATCCTT AAACGCGAAG GTTTTGTAAA 6780 
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AAACGTTGAA ATCATTGAAG ATGACAAACA AGGCGTCATC CGTGTATTTC TTAAATACGG 6840 

ACCAAATGGT G AG AAAGT T A TCACTAACTT GAAACGTGTT TCTAAACCAG GACTTCGTGT 6900 

CTACAAAAAA CGTGAAGACC TTCCAAAAGT TCTTAACGGA CTTGGAATTG CCATCCTTTC 69 60 

AACTTCTGAA GGTTTGCTTA CTGATAAAGA AGCACGCCAA AAGAATGTTG GTGGTGAGGT 7 02 0 

TATCGCTTAC GTTTGGTAAA AT C AAG AT AC AAAGCTCGTA AAGAACAAAG CAAAATTAGG 7 080 

AAGTTGGAGA AGTTTGTTTA CAAACAAGCC AACTTATCTA TTTTGCACAG TTCTTAGAGC 7140 

GTGTTCAGTT CAGCTCTTGA ACTAAATAAG TATCTGAACC CCGTGAAAAC TGGCCGTTCT 7200 

GGCCTGACAA TTTAACAGGA GAAAATAAAC ATGTCACGTA TTGGTAATAA AGTTATCGTG 72 60 

TTGCCTGCTG GTGTTGAACT CGCTAACAAT GACAACGTTG TAACTGTAAA AGG AT C T AAA 73 2 0 

GGAGAACTTA CTCGTGAGTT CTCAAAAGAT ATTGAAATCC GTGTGGAAGG TACTGAAATA 73 80 

ACTCTTCACC GTCCAAACGA TTCAAAAGAA ATGAAAACTA TCCACGGAAC TACTCGTGCC 744 0 

CTTTTGAACA ACATGGTTGT TGGTGTATCA GAAGGATTCA AGAAAGAACT TGAAATGCGT 7 500 

GGGGTTGGTT ACCGTGCACA GCTTCAAGGA TCTAAACTTG TTTTGGCTGT TGGTAAATCT 7 5 60 

CATCCAGACG AAGTTGAAGC TCCAGAAGGA ATTACTTTTG AACTTCCAAA CCCAACAACA 7 62 0 

ATCGTTGTTA GCGGAATTTC AAAAGAAGTA GTTGGTCAAA CAGCTGCTTA CGTACGTAGC 7680 

CTTCGTTCAC CAGAACCATA TAAAGGTAAA GGTATCCGTT ACGTTGGTGA ATTCGTTCGC 7 74 0 

CGTAAAGAAG GTAAAACAGG TAAATAATGT TGAGTGGTTG ATCATCAACC ACCAACCTAT 7 80 0 

TTTCCAACTT TGTGCATAGC ACACGATTTA AAACTAAAGA GGTGAAAACT GTGATTTCAA 7 8 60 

AACCAGATAA AAACAAACTC CGCCAAAAAC GCCACCGTCG CGTTCGCGGA AAACTCTCTG 7 92 0 

GAACTGCTGA TCGCCCACGT TTGAACGTAT TCCGTTCTAA T AC AGGC AT C TACGCTCAAG 7 9 80 

TGATTGATGA CGTAGCGGGT GTAACGCTCG CAAGTGCTTC AACTCTTGAT AAAGAAGTTT 8 04 0 

CAAAAGGAAC TAAAACTGAA CAAGCCGTTG CTGTCGGTAA ACTCGTTGCA GAACGTGCAA 8100 

ACGCTAAAGG TATTTCAGAA GTGGTGTTCG ACCGCGGTGG ATATCTATAT CACGGACGTG 8160 

TGAAAGCTTT GGCTGATGCA GCTCGTGAAA ACGGATTGAA ATTCTAATAG GAGGACACTA 82 2 0 

GAAAATGGCA TTTAAAGACA ATGCAGTTGA ATTAGAAGAA CGCGTAGTTG CTGTCAACCG 82 80 

TGTTACAAAA GTTGTTAAAG GTGGACGTCG TCTTCGTTTC GCAGCTCTTG TTGTTGTTGG 8340 

TGACCACAAT GGTCGCGTAG GATTTGGTAC TGGTAAAGCT CAAGAAGTTC CAGAAGCAAT 8400 

CCGTAAAGCA GTAGATGATG CTAAGAAAAA CTTGATCGAA GTTCCTATGG TTGGAACAAC 8460 

AATCCCACAC GAAGTTCTTT CAGAATTCGG TGGAGCTAAA GTATTGTTGA AACCTGCTGT 8 52 0 
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AGAAGGTTCT GGAGTTGCCG CTGGTGGTGC AGTTCGTGCC GTTGTGGAAT TGGCAGGTGT 8580 

GGC AG AT AT T AC AT CT AAAT CACTTGGTTC TAACACTCCA ATCAACATTG TTCGTGCAAC 8640 

TGTTGAAGGT TTGAAACAAT TGAAACGCGC TGAAGAAATT GCTGCCCTTC GTGGTATTTC 8700 

AGTTTCTGAT TTGGCATAAG AAAGGGGATA AAATGGCTCA AATTAAAATT ACTTTGACTA 87 60 

AGTCTCCAAT CGGACGCATT CCATCACAAC GTAAAACTGT TGTAGCACTT GGACTTGGCA 882 0 

AATTGAACAG CTCTGTTATT AAAGAAGATA ACGCTGCTAT CCGTGGTATG AT C AC AGC AG 8880 

TATCTCACTT AGTAACAGTT GAAGAAGTAA ACTAATGAaG TTTTAGGGGA TGTGCACTGT 8940 

ACCATCCCCT AAAACTAGAT ATAGTCATCT ATGATGACAT CGTATAGGCG AGTTGATGGG 9000 

GGAGACAACC TTTTCTCCCT TATCGGCGCT AGCATTTTAC AAAAGAGGAG AAAATAAAAA 9060 

TGAAACTTCA TGAATTGAAA CCTGCAGAAG GTTCTCGTAA AGTACGTAAC CGCGTTGGTC 9120 

GTGGTACTTC ATCAGGTAAC GGTAAAACAT CTGGTCGTGG TCAAAAAGGT CAAAAAGCTC 9180 

GTAGCGGTGG CGGAGTTCGC CTTGGTTTTG AAGGTGGACA AACTCCATTG TTCCGTCGTC 924 0 

TTCCAAAACG TGGATTCACT AACATCAACG CTAAAGAATA CGCAATTGTG AACCTTGACC 9300 

AATTGAACGT CTTTGAAGAT GGTGCTGAAG TAACTCCAGT TGTTCTTATC GAAGCAGGAA 9 3 60 

TTGTTAAAGC TGAAAAGTCA GGTATTAAAA TTCTTGGTAA CGGTGAGTTG ACTAAGAAAT 942 0 

TGACTGTGAA AGC AG CT AAA TTCTCTAAAT CAGCTGAAGA AGCTATCACT GCTAAAGGTG 94 80 

GTTCAGTAGA AGTCATCTAA GAGAGGTGAC CTATGTTTTT TAAATTATTA AGAGAAGCTC 9540 

TTAAAGTCAA GCAGGTTCGA TCAAAAATTT TATTTACAAT TTTTATCGTT TTGGTCTTTC 9 600 

GTATCGGAAC TAGCATTACA GTTCCTGGTG TGAATGCCAA TAGCTTGAAT GCTTTAAGTG 9660 

GAT T ATCCTT CTTAAACATG TTGAGCTTGG TGTCGGGGAA TGCCCTAAAA AACTTTTCGA 972 0 

TTTTTGCCCT AGGAGTTAGT CCCTATATCA CCGCTTCTAT TGTTGTCCAA CTCTTGCAAA 97 80 

TGGATATTTT ACCCAAGTTT GTAGAGTGGG GTAAACAAGG GGAAGTAGGT CGAAGAAAAT 9840 

TGAATCAAGC TACT CGT TAT ATTGCTCTAG TTCTCGCTTT TGTGCAATCT ATCGGGATTA 99 00 

CAGCTGGTTT TAATACCTTG GCTGGAGCTC AATTGATTAA AACTGCTTTA ACTCCACAAG 9960 

TTTTTCTGAC GATTGGTATC ATCTTAACAG CTGGTAGTAT GATTGTCACT TGGTTGGGTG 10020 

AGCAAATTAC AGATAAGGGA T AC GGAAACG GTGTTTCCAT GATTATCTTT GCCGGGATTG 1008 0 

TTTCCTCAAT TCCAGAGATG ATTCAGGGCA TCTATGTGGA CTACTTTGTG AACGTCCCAA 10140 

GT AG C CGT AT CACTTCATCT ATCATTTTCG TAATCATTTT GATTATTACT GTATTGTTGA 102 00 

TTATTTACTT TACAACTTAT GTTCAACAAG CAGAATACAA AATTCCAATC CAATATACTA 10260 

AGGT TGC AC A AGGTGCTCCA TCTAGCTCTT ACCTTCCGTT AAAAGTAAAC CCTGCTGGAG 1032 0 
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TTATCCCTGT TATCTTTGCC AGTTCGATTA CTGCAGCcTG CGGCTATTCT TCAGTTTTTG 103 80 

AGTGCCACAG GTCATGATTG GGCTTGGGTA AGGGTAGCAC AAGAGATGTT GGCAACTACT 10440 

TCTCCAACTG GTATTGCCAT GTATGCTTTG TTGATTATTC TCTTTACATT CTTCTATACG 10 5 00 

TTTGTACAGA TTAATCCTGA AAAAGCAGCA GAGAkCCTAC AAAAGAGTGG TGCCTATATC 10560 

CATGGAGTTC GTCCTGGTAA AGGTACAGAA GAATATATGT CTAAACTTCT TCGTCGTCTT 10 62 0 

GCAACTGTTG GTTCCCTCTT CCTTGGTGTG ATTTCCATTT TACCGATTGC AG CT AAAG AT 10 680 

GTATTTGGTC TTTCTGATGT TGTTGCCTTT GGTGGAACAA GTCTCTTGAT CATTATCTCT 10740 

ACAGGTATCG AAGGAATCAA GCAATTGGAA GGTTACCTAT TGAAACGTAA GTATGTTGGT 10 800 

TTCATGGACA GAACAGAATA AAAGTATTTA CTGAATCAGT AAAT AC TG AG GGAGTGGAGG 10860 

TTTAAACTCT GACATTTGTA AGAGTTGGAT CTCCCCTCTT CTATTTTGTT TTTAAATCGG 10 92 0 

GGTGAAAAGA CTTTTTGCTT CTATTTAAAA ATAAAATAAG GAG AT C AAAT CATGAATCTT 10 9 80 

TTGATTATGG GCTTACCTGG TGCAGGTAAG GGAACTCAAG CAGCAAAAAT CGTAGAACAA 1104 0 

TTCCATGTTG CACATATCTC AACAGGTGAT ATGTTCCGCG CTGCAATGGC AAATCAAACT 11100 

GAAATGGGTG TTCTTGCTAA GT CAT AT ATT GACAAGGGTG AATTGGTTCC TGACGAAGTT 11160 

ACAAATGGAA TCGTAAAAGA ACGCCTTTCA CAAGATGATA TTAAAGAAAC AGGATTCTTA 112 2 0 

TTGGATGGTT ACCCACGTAC AATTGAACAA GCTCATGCCT TGGACAAAAC ATTGGCTGAA 112 8 0 

CTTGGCATTG AACTAGAAGG TGTTATCAAT ATTGAAGTGA AC C C T G AC AG CCTTTTGGAA 113 4 0 

CGTTTGAGTG GGCGTATCAT CCACCGCGTA AC T GG AG AAA CTTTCCACAA GGTCTTTAAC 114 00 

CCACCAGTTG ACTATAAAGA AGAAGATTAC TACCAACGTG AAGATGATAA GC CTG AG AC A 114 60 

GTAAAACGTC GTTTGGATGT TAATATTGCT CAAGGAGAAC CAATCATTGC TCACTACCGT 1152 0 

GCCAAAGGTT TGGTTCATGA CATCGAAGGT AATCAAGATA TCAATGATGT C T TC T C AG AT 115 8 0 

ATTGAAAAAG TATTGACAAA TTTGAAATAA AGCGTTTTTC ACACTTGCAA AAATCCGCTA 1164 0 

CAAATGTTAT ACTGAGATAG TCTGACTTAT AATTGTTGTC TCTGTGTCTA GAGGCATCGA 117 00 

ATCGAAATTT ATGGAGGTGC TTTTGCGTGG CAAAAGACGA TGTGATTGAA GTTGAAGGCA 117 60 

AAGTAGTTGA TACAATGCCG AATGCAATGT TTACGGTTGA ACTTGAAAAT GG AC AT C AG A 1182 0 

TTTTAGCAGG G 11831 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 66: 

CCCGGCATTT GAAAGCTATT CGTGAAGGAT TTATGATGGC AATGCCTTTG ATTTTAGTCG 60 

GCTCTTTATT TCTTATTCTA ATCAGTTGGC CTCAAGAGGC TTTTACAAAT TGGCTGAATA 12 0 

GTGTTGGATT GCTAAGTATC TTGACAACTA TGAATCAGTC AACAGTAGCG ATTATCTCCT 180 

TGGTCGCTTG TTTCGGTATT GCCTACAGGT TGTCGGAAGG ATATGGTACA GATGGTCCGT 24 0 

CGGCAGGGAT CATAGCCTTA TCCAGTTTTG TATTGATGGC ACCTCGTTTT TCGAGTATGG 300 

TTTATGATAA AAATGGGGAG CAGGTCAAGC AGTTATTTGG CGGCGCAATA CCATTTTCTA 3 60 

GCCTGAATGC ATCTTCTTTG TTTATGGCGA TTACTATTGG ATTGGTTACA GCAGAGATTT 42 0 

ATCGTATGTT TATCCAGCGC GGAATTACGA TAAAAATGCC AAGTGGTGTC CCAGATGTAG 4 80 

TAAGTAAATC ATTTTCAGCT CTTTTATCTG GTTTTACTAC TTTTGTTTTG TGGGCTTTGG 54 0 

TCTTAAAAGG TCTTGAAGCG GCAGGAGTTG CAGGAGGTCT CAACGGACTC CTAGGTGCAA 60 0 

TTGTTGGAAC ACCGCTTAAG TTAATTGCAG GAACGCTTCC AGGTATGATT CTATGTGTTA 66 0 

TTGTAAACTC ATTCTTTTGG TTCTGTGGAG TTAATGGGGG ACAAGTTTTA AATGCTTTTG 72 0 

TAGACCCAGT TTGGTTACAA TTTACTACAG AAAACCAAGA AGCTGTGGCT GCAGGACAAA 780 

CACTCCAACA CATTATTACA TTACCGTTTA AAGATTTATT TGTATTTATT GGTGGCGGTG 84 0 

GAGCGACTAT TGGTCTTGCG ATTTGTCTCT TCCTATTTAG TAAGAGTCGT GCGAATAAAA 9 00 

CATTAGGTAA GCTAGCTATT ATACCGTCTA TTTTTAATAT CAATACAGCT ATTCTATTTA 9 60 

CGTTTCCAAC AGTTTTAAAT CCGATTATGC TGATTCCGTT TATTGCTACT CCTACAATCA 102 0 

ATGCCTTGAT TACCTATGTA TCAATGGCTG TAGGATTAGT ACCCTATACA ACAGGTGTAA 1080 

TCCTTCCGTG GACAATGCCA CCGATTATAG GAGGCTTCCT TGCAACAGGG GCTAGTTGGC 114 0 

GAGGAGCTCT ATTACAAGTT GTTTTGATTT TGGTTTCTGT AGCAATTTAT TATCCATTCT 12 00 

TCAAAATTGC AG AT AAACG C AATCTTGAAA AAGAAAAAGC TACTGTTGGA GGGAAATAAG 12 6 0 

ATGGTTATCA GAGTATTTGA TCAACAGAAA AATACTTATT CTAGCTTTGC CTTAGAGGAA 13 2 0 

TTAAGTTACT ATATGAATCG GGTCTTTAAG ACTAACATAG AGCTTGTCGA GGAGAAGGAA 13 80 

GCGGATATTT TTGTAGGATT AGT C AAT AAA GAGGACAGAA AAGACCATGT TCTTATCTCA 144 0 

TTAGACAAGG GTAAGGGGAG AATTGAGTCT AATACAATTG TAGGTTTACT TAT T GG AAT T 1500 

TACCGAATGT TTCATGAATT TGGGGTTGTG TATACTAGAC CAGGGCGCAG ACATGACTTT 15 60 

GTTCCAGAGT TACGATTTGA AGATTTTTTA GATAAACAGC TATCTATAGA TGAAACAGCC 1620 
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AGTT ACT AT C ATAGGGGAGT ATGTATAGAG GGAGCGGATT CATTTGAAAA TATACTAGAT 1680 

TTCATTGATT GGCTACCTAA GATTGGGATG AACAGTTTTT TCATCCAGTT TGAAAATCCT 1740 

TACTCTTTTT TGAAACGTTG GTATGAACAT GAATTTAATC CAT AT C T AAA TAAAGAACAA 1800 

TTTTCAAATG AATTAGTACA AGAATTGAGT GATAGGTTGG ATAAAGAATT GCAAAAAAGA 1860 

GGTCTTATTC ATCATCGTGT TGGTCATGGA TGGACAGGTG AAGTTTTAGG TTACTCTTCA 192 0 

AAATTTGGCT GGGAATCAGG TCTTAGTATT T C AG AG GAG A AGAAACCCTA TGTCGCTGAA 1980 

ATAAACGGGA AACGAGAATT GTTTAATACG GCTCCGATTT TAACCAGCCT GGATTTTTCA 2 04 0 

AATCCAGATG TAGCTGATAA GATGGTAGAA ATTATCAAGG ATTATGCCAA GAAAAGACCT 2100 

GATGTTAACT ACTTACATGT ATGGTTGTCG GATGCTCGTA ATAATATTTG TGAATGCGAA 2160 

AACTGTAGAC AAGAATTGGT TTCGGATCAG TAT AT T CGT A TTCTCAATCA AT TGG AT AGG 222 0 

GCTTTAACGA GTGAGGGATT AGATACAAAG ATTTGTTTTC TGCTTTATCA TGAGTTGTTA 2 2 80 

TGGGCACCTC AGAAAGAAAA AT T AG AT AAT CCTGAACGCT TTACCATGAT GTTTGCACCG 2 34 0 

ATTACAAGAA CATTTGAAAT GAGTTATGCA GATGTAGATT TTGACAATTC CATACCTACG 2400 

CCTAAACCTT ATATGCGTAA TAAAATTATA CTTCCGAATT CTCTTGAGGA AAATTTATCT 2 4 60 

TATCTTTTTG AGTGGCAAAA AGC AT T T AAA GGAGATAGTT TCGTATATGA CTATCCTTTA 2 520 

GGGCGTGCTC ATTATGGCGA TTTAGGCTAT ATGAAAATTA GTCAAACTAT TTACAGAGAT 2 58 0 

GTATCTTATC TTTCCAACCT ACATTTGAAC GGGTACATTT CGTGTCAAGA ATTACGTGCC 2 64 0 

GGATTCCCTC ATAATTTTCC TAATTATGTC ATGGGGGAAA TGCTCTGGAA GAAGACAAGA 2700 

AGTTATGAAG AATTGATTGA AG AAT AC T T T TCTGCTTTGT ATGGGGAAAA TTGGCAGTCT 27 60 

GTTGTTGAAT ATTTAGAAAA ATTATCCATT TATTCCTCTT GTGATTATTT TAATGCAATT 2 820 

GGCAGCCGTC AAAGTGATGT TTTAGCGAAT CATTATTATA TAGCTTACAA TCTAGCTGAT 2880 

AATTTTTTAC CAATTATTGA GGAAAATATT TCTAAGTTAT TAAATAGTCA AAAGGATGAA 2 940 

TGGAAACAGC TCAGTTATCA TCGTGAATAT GTTGTTAAGA TGGCGAAGGC TTTATATCTT 3000 

CAAGCAACTG GAAAAACAAG GCAAGCTCAA GATGAATGGA GAAATGTGTT G AAT TAT AT C 3060 

CGTGGGCACG AATTGCTATT TCAATCTAAT TTGGATGTTT ATCGTGTAAT TGAAGTAGCA 312 0 

AAAAATTACG CTGGTTTCCA CTTATAAATC ATAAGTATAG AAAATGAACT AAGGTATTCA 3180 

GAGAAGATTG AT CCT AAAT A TTATGAAATT TAAGGATTTT TAAGATATTT AGGGTCAACT 3240 

TTCTATTTAT ATCGTAGCGA AGTCATTTTA ATAATGATGT GTAAAAGATG GATCAAGATT 33 00 

GAGGAGGAAG AAAGATGAAA TCAAAAGAAG AAAT AAAT AT GCTTGGTTTT ACAATTGTCG 33 60 
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CTTACGCAGG 


AGATGCAAGG 


TCAGATTTGA 


TGGATGCTTT 


GGCGTTTGCG 


AGAGATGGAT 


3420 


ATTTTGAACA 


GGCAAGAGAA 


TTGGTTGAGT 


CTGCAAACGA 


CTCAATAGTG 


TCTGCCCATC 


3480 


GAGAACAGAC 


TAATTTATTA 


GCGGAGGAGG 


CATATGGAGA 


TAATTTTGAA 


GTGAGCTTTA 


3540 


TTATGATTCA 


TGGTCAAGAT 


ACTTTGATGA 


CAACGATGCT 


ATTGTATGAT 


CAGGTAAAGT 


3600 


TTTTTATTGA 


TGAATATGAA 


CGAATTCGAA 


AGATTGAAGA 


ACATATTGGT 


TTGCAATGAG 


3660 


GAT TAG TC AT 


GGAAAATTTA 


CAGGTTAAAG 


CCTTACCGAA 


GGAGTTTTTA 


TTAGGAACTG 


3720 


CTACCGCTGC 


TTATCAAGTA 


GAGGGTGCAA 


CTAGGGTAGA 


TGGCAAAGGA 


ATAAATATGT 


3780 


GGGATGTTTA 


TTTGCAAGAA 


AATAGTCCGT 


TCTTACCAGA 


TCCAGCTAGT 


GATTTTTATT 


3840 


ATCGTTACGA 


AGAGGATATA 


GCTTTGGCGG 


CAGAACATGG 


TTTGCAGGCT 


TTGCGTTTAT 


3900 


CTATTTCTTG 


GGTTCGTATA 


TTTCCTGATA 


TAGATGGGGA 


TGCTAATGTA 


TTAGCTGTTC 


3960 


ATTATTACCA 


TAGAGTTTTT 


CAGTCTTGCT 


TAAAACATAA 


TGTGATTCCG 


TTTGTTTCTT 


4020 


TACATCATTT 


TGATTCGCCT 


CAGAAAATGT 


TAGAAACAGG 


GGATTGGTTG 


AACAGAGAGA 


4080 


ATATTGATCG 


T T T CAT AC G A 


TATGCTCGCT 


TTTGTTTCCA 


AGAATTTACA 


GAAGTCAAGC 


4140 


ATTGGTTTAC 


AATCAATGAA 


CTGATGTCTC 


TTGCTGCAGG 


T C AAT AT AT A 


GGAGGTCAGT 


4200 


TTCCTCCAAA 


TCATCATTTT 


CAATTATCTG 


AAGCAATTCA 


AG CG AAT CAT 


AATATGTTGT 


4260 


TGGCGCATGC 


TCTTGCAGTC 


CTCGAATTTC 


AT C AAT TAG G 


GATTGAGGGA 


AAGGTAGGTT 


4320 


GTATTCATGC 


TTTAAAGCCA 


GGCTATCCTA 


TTGATGGGCA 


AAAAGAAAAT 


ATTTTGGCAG 


4380 


CTAAACGGTA 


TGATGTTTAT 


AATAATAAAT 


TTCTATTAGA 


TGGAACTTTT 


TTGGGCTACT 


4440 


ACAGTGAGGA 


CACGCTTTTT 


CACTTGAATC 


AAATATTGGA 


AGCTAATAAT 


TCTAGCTTTA 


4500 


TTATTGAAGA 


TGGTGATTTA 


GAAATTATGA 


AG AG AG CTGC 


ACCTCTTAAT 


ACGATGTTTG 


4560 


GGATGAATTA 


TTATCGTTCA 


GAATTTATTC 


GTGAATACAA 


AGGTGAAAAT 


AGACAAGAAT 


4620 


TTAATTCAAC 


AGGAATAAAA 


GGACAGTCTT 


CTTTTAAATT 


AAATGCTCTA 


GGTGAATTTG 


4680 


TAAAAAAACC 


TGGTATTCCG 


ACAACAGATT 


GGGATTGGAA 


TATTTATCCT 


CAAGGGTTAT 


4740 


TTGATATGTT 


GCTTCGTATC 


AAAGAAGAAT 


ATCCTCAACA 


TCCGGTCATT 


TATTTAACTG 


4800 


AAAATGGTAC 


AGCCCTTAAA 


GAAGTTAAGC 


CAGAGGGCGA 


G AAT GAT AT T 


ATTGATGACA 


4860 


GTAAGAGAAT 


CCGTTATATT 


GAGCAACATT 


TACACAAAGT 


TTTAGAGGCT 


CGAGATAGAG 


4920 


GAGTCAATAT 


TCAAGGCTAT 


TTTATATGGT 


CTTTGCAAGA 


TCAATTTTCT 


TGGGCGAATG 


4980 


GCTACAATAA 


GCGATATGGT 


CTTTTCTTTG 


TTGATTATGA 


AACACAGAAG 


AGATATATTA 


5040 


AGAAAAGTGC 


TCTTTGGGTA 


AAAGGGCTAA 


AACGG AAT T A 


AGGTTAGCGA 


TTTGACTGAT 


5100 


GTTTAATATG 


TTTTAAATAT 


GAGGTTGAAT 


TTTTTATAGG 


AGGAGTTTTA 


TGGATAAGCT 


5160 
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AGTCGCTGCC ATTGAAAAGC AACAAGGGAA ATTTGAAAAA ATTTCTACTA ATAACTATAT 52 2 0 

GATGGCTATT AAAGATGGAT TCATTGCTAC TATGCCTTTA ATTATGTTTT CAAGCTTTTT 5280 

GATGATTATT ATTATGATTC CTAAAAATTT CGGAGTAGAG TTACCGAGTC CAGCTATTGT 534 0 

CTGGATGAGA AAAGTGTATA TGTTAACCAT GGGAGTTTTG G GT AT T AT TG TTTCAGGGAC 54 00 

TGTTGGAAAG TCATTAGTTG GAAATGTTAA CAGAAAAATG CCTCACGGAA AGGTAATAAA 54 60 

T G AT AT TT C T GCAATGTTGG CAGCCATATG TAGTTATCTG GTATTAACTG TAACGCTTGT 552 0 

AGTTG AT GAG AAGACGGGAT CTACAAGTTT GTCGACAAAC TATTTAGGAT CTCAAGGATT 55 8 0 

GATAACTTCG TTTGTCAGTG CCTTTATTAC TGTAAATGTT TACCGATTCT GTATTAAGCG 564 0 

AGACATTACT ATTCATTTAC CTAAGGAAGT TCCTGGGGCT ATATCACAAG CTTTTAGAGA 57 00 

TATTTTCCCT TTTTCTTTTG TTTTACTTAT TAGTGGTTTG T TAG AT AT T G TATCTCGGTT 5760 

TAGTTTAGAT GTTCCTTTTG CCCAAGTATT TCAACAACTA TTGACTCCTA TTTTTAAGGG 5820 

GGCAGAATCA TATCCTGCTA TGATGTTGAT TTGGTTTATG TGTGCTTTGC TTTGGTTTGT 5 880 

TGGAATTCAT GGACCATCTA TTGTCTTACC TGCTGTTACA GCTTTGCAAC TGAGCAATAT 594 0 

GGAAGAGAAT GCTCAACTTC TTGCAAATGG GCAGTTCCCT TATCATTCTT TAACACCTAA 6000 

TTTCGGGAAT TATATCGCTG CTATTGGAGG AACGGGGGCT ACCTTTGTTG TACCATTTAT 6060 

TTTGATTTTC TTTATGCGGT CTAAACAATT AAAATCGGTA GGTAAAGCTA CAATTACTCC 612 0 

TGTTTTATTT GCGGTAAATG AACCTCTTCT ATTTGGTATG CCTGTTATTT TGAATCCCTA 6180 

TCTTTTTGTC CCTTTTTTGA TGACTCCACC AGTGAATGTA TTTCTAGGAA AGGTCTTTAT 62 40 

TGATTTCTTT GGAATGAATG GATTTTATAT CCAGTTACCT TGGACCTTTC CTGGTCCCTT 6300 

GGGATTGTTA ATTGGAACGA ATTTTCAACT TATCTCCTTT GTATTTTTAT CTTTGATTTT 6360 

AGTTGTCGAC ATATTGATTT ATTTGCCATT CTGTAGAGCG TAT GAT AG AC AGTTACTGGT 642 0 

GAAAGAAGAT ATTGCAAGCT CAAATGATAT TATTTTAGAG GAGGATACAA GTGAAATAAT 6480 

TCCTGGTGAG AT AG AT G AAA TAAAAAGTAA GG AGTTG AAA GTACTGGTTC TTTGTGCAGG 6540 

GTCTGGAACA AGTGCGCAAT TAGCCAATGC AATTAACGAG GGGGCTAACT TAACAGAGGT 6 600 

TAGAGTGATT GCGAATTCAG GAGCGTACGG AG CT CAT TAT GATATTATGG GTGTTTATGA 66 60 

TTTAATTATT CTGGCCCCAC AAGTTCGGAG TTATTATAGA GAGATGAAGG TGGATGCAGA 6720 

AAGATTAGGT ATTCAGATAG T TGCT AC C AG AGGAATGGAA TATATTCATT TAACAAAGAG 6780 

TCCAAGTAAA GCCTTACAAT TTGTATTGGA GC AT T AC C AA GCTGTGTAGT AAGTTTTTCC 684 0 

ATCTTTTATT TGAGTAAAGA TTTTGTTTAC AG AT AGG C T T GGATTTAAAA ACGTTCCCCC 6900 
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TTTTTTAATA TAAGAATCCC TCTTTCACAA TTGTAAAAAG AGGGATTTTG TATTTTATCT 6960 

CTTAGACCAA GTTCTCTTCA TAAAGAGAAG GAGGATTGGG TAAATCTCCA AGCGCCCTGC 7 020 

AATCATTGCA AAGGATAGGA GAATTTTTGA GATGGGACTA AAG AT T G AG A AACTAGAAGT 7 08 0 

GGTTCCTAGA ATAGGCCCGA TATTATTGAA ACAGCTAAAG ACAGCGCTGG TC AC G AC C AG 7140 

AAAATC AT TG CTATCTAGGC TGACAATAAA G AT AAG CG CT AGCAAAATCA TAGCATAGAT 72 00 

GACAAAGTAC TTGAGAATCT TATGCTGGGT ATCTTTGTCA ATCACCGTTT TATTAACATG 72 60 

G AGGGT C AAA ACACGGTGGG GCGATAGGAT TGACAAAATT TGGTTTTTGG CAATTTTTGA 73 2 0 

AAGGATGAGG CCTCGAATAA TCTTGAGTCC ACCTGCAGTT GAT C C AG C AG AGCCACCGAT 73 80 

TGCCATGAGG AAAAGGAGGA TAAACTGGGA GAAGAGGGGC CAGTTGGTAA TATCTCCATA 744 0 

TCCAAAACCA GTTGTTGTAA TGATGTTGGA AACCTGGAAG AAGGTCATTT CAAAGCTCTT 7500 

TGAAAACCCT GGGTAGAGGT AGAGGGTGTT GAGGCTAATC AAGCCTGTAG AAACCAGTAC 7560 

AATGACCAAG TAAGCCCTAA GCTCTTCATC TCCAAAGAAG GCCTTGATGC GACGGAGCAT 7 62 0 

GAGGTAGTAG TAGAGGTTGA AATTTACTCC AAAAACCAGA AC T C C GAT AC TGACCAGATA 7 680 

GGTAATCAGT GAGCTGCCAT AGTGGGCAAT TCCGTCGTTA TAG AC G G T AA AGCCTCCAGT 774 0 

TCCCGCTGTC CCCATAGCAA TAACAAAACT ATC GT AG AG A GGCATACCGG C T AG AT AAT A 7 800 

GATGATGACA AAGAGGGAGA AGAGAGCTAG ATAAAGGAGA TAG AG AAT C T GGGCAGTGTT 7 8 60 

TTTTAGTTTG G AT AC AACC T TGCCAAAAAC AGGACCTGGA ACCTCAGCCT TCATCACCTC 7 92 0 

TAGGTGGCTA TTTTTGGCAT TGTCCATAAT AGCAAGTGCA AAAACAAGCA CTCCCATCCC 79 8 0 

TCCAATCAAG TGGGTAAAAC TTCGCCAGAA GAGGAGGGAA CGGCTGAGAA CCGAAACGTC 8040 

GTTCAAAATA CTTGCTCCAG TAGTTGTAAA TCCAGAACTA ATTTCAAAAA AGGCATCAAT 8100 

AAGGCTGGGG ATTTGCCCAG AAAAGACAAA GGGGAGACCA CCAAAGAAAG ACCAAAGGAT 8160 

CCAACAGAGG GCAACGATCA AGACTCCCTC CTTGGCATAA ATCCGTTGAT TTTTTGGCTT 822 0 

CTGTAAACTC CCTGAACCGC CTAACAATAC GAG AAT C C CT ATGGTCGAAA AGAGGGCTGT 82 80 

AAAGACTTGG CTCGATTCAC GGTAATAGAC AGCAATCGCA ACAGGAACCA AAAGAAGAAC 8340 

AGCTTCAATC AAAAGTAATT TTGAAAGGAG GTAACGAATC ATACTTTTAT TCATTTCTTA 84 00 

CCTCGCGATC AAGTCATAAA TCTTGGTGAT GTTTGGCAAC AAGGTTGTTA CTAGGAGCTT 84 60 

GTCTCCAACT TCCAACATAT CCTCCCCAGT TGGGAAAATA GTCTTGCCCT TTCGAATAAT 8520 

GGCTGCAATA AGAACCCCTT TTTTCAATTT CAGTTGAGAA AGAGGTTTGG CAGTCATTTT 8580 

ATTGGCTTCC TTGATATGGA ATTGCAGGGT TTCGATTTGG CCATTGGCTA GATGGTGCAT 8 640 

AGCTTGAAGG TCTGAATACT GGGCATTAAC T CG AC C AC G A ATAAAGTGCA T AAT C GT ATC 8700 
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T AC AG CG AT G CTTTTAGGTG TGATGATACT TGAAAAATCA GGCGCATTGA TAATCTCGAG 87 60 

GAGACTGGTA CGATTGACCT TAGTAATATT TTTCTGTACA CCTACCCTGT CAAGGAACAT 882 0 

AGATGTAATC AGATTTTCCT CATCGACTCC TGTTAGAGTC GCAACGGCAT CATAGTGTTG 8 880 

AGCACTTTCT TCCAGCAGGA TATCTTTTGC GGTTCCATCT CCTTGAACGA TGTAGAGATT 8940 

TGGGAATTTC TCGCTAAAGA AGCTGGCGAT TTCAGGATTG ATTTCAATGA CTTTTGTATC 9000 

GATACGACTA TCTTTGAGAA TACCAAGTAG ATAATAGGCA ATTCTACCTG CCCCAACGAT 9060 

GAGAAGGCTC TTCACGGCGC GTGATTTAAA ATAATTATGG AAGAGTATCA TATCGACACG 912 0 

GTTACCAGTG AC AAAG AT TC TATCTTTATC CTGTACAGTC ATGTCACCGC TTGGAATGAT 9180 

AATTTGATGA TCCCTCTCTA TCGCACAGAC AATGACATTA CCAAATTTTT TACGAAAATC 9240 

AGAAATGGGC ATTTGGCAAA GACCGCTGGT GGACTTGACG ACAAATTCCA TGAGGCTAAC 93 00 

GCGTCCACCA GCAAAGCGTT CGACAGACAG GGCGTTGGGG AAGTCAATGA TATTCGCGAT 9 3 60 

AGCGCGGGCA GCCAAGAGCT CAGGATTAAC GATAAGAGAA AAACCGAGAA TATTCTTTTC 942 0 

CTTGAAATAA GAGTTAGAAT ATTCAGGGTT CCGCACCCGA ACGATAGTTT CTTTAGCTCC 94 80 

CATTTTCTTG GCTAGAACTG CTGCAATCAT GTTGACTTCA TCGTGCTCAG T C AGGGCG AT 9540 

AAAG AT AT C A CAATCTTGGA CGCTGGCTTG CTCAAGAATG GCAAAATCGG CCCCGTTACC 9600 

AAGGATACCA ATGATATCAA AGCGACTGAC AATATGATTG AGAACAGCTT CGTCTTGCTC 9 660 

AATCAGCAAA ACATCATGCT TTTCTGCAAC CAAGGAGCGA CAGAGGGCAA AACCAACTTT 972 0 

TCCCCCTCCG ACAAGGATAA TTTTCATAAT AAAACCTACT TTTTCATGAT GTAACTATCA 97 80 

TACCCTTTTT CAAGAAAAAA TGCACCTACT AGCTAATAAC AAGAGTTTTT AGTGAAAATT 9 840 

CGCTATAAGG TAAAACTATA CCCTAACCAA TTGAAATAGC TAT TAG CG AC TTTCTCTGAA 9900 

AT ATGGT AT G ATAAAGGATA TACAAGGAGA TAAAATGAAT AATAATTTAC TGGTATTACA 9960 

ATCAGACTTT GGTCTGGTTG ATGGTGCGGT ATCGGCTATG ATTGGAGTGG CTTTAGAAGA 1002 0 

GTCTCCAACC TTAAAAATAC ATCACTTGAC G C AC GAT AT C ACGCCTTATA ATATTTTTGA 10080 

GGGGAGCTAT CGTCTCTTTC AGACGGTGGA TTACTGGCCT GAGGGAACGA CGTTTGTATC 10140 

GGTTGTCGAT CCAGGTGTCG GTTCGAAACG TAAGAGTGTA GTTGCCAAGA CTGCAAAAAA 10200 

TCAATACATT GTCACGCCAG ATAATGGGAC GCTTTCCTTT ATCAAGAAAC ACGTTGGCAT 10260 

TGTAGCCATT CGTGAGATTT CTGAGGTGGC CAATAGGCGT CAAAACACAG AGCATTCTTA 10320 

TACCTTCCAC GGTCGTGATG TCTATGCCTA TACTGGTGCT AAACTGGCCA GTGGTCACAT 103 8 0 

TACTTTTGAG GAAGTAGGGC C AGAGCT C AG TGTGGAACAG ATTGTAGAGC TTCCAGTCGT 10440 
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AG CG AC CATC AT AG AAG AT C ATCTGGTGAA GGGAGCCATT GATATTCTGG ATGTGCGTTT 



10500 



CGGTTCGCTT TGGACCTCTA TCACACGGGA AGAATTTTAC AAGCTGGAAC CAGAATTTGG 



10560 



TGATCGTTTT GAAGTGACCA TCTATCATGC TGATATGCTG GTCTATCAAA ATCAGGTTGT 



10620 



CTATGGCAAA TCATTTGCAG ATGTGAGAAT TGGGCAACCs ATcTTTACrc TCAGCaTCTt 



10680 



CGATTAGCTG GGCAATTCGT TCTAGTTGGA TTTCGTCAAT CAAGGT 



10726 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTATCTTTAA CGATATCAAT CAAGATCTGG TCAATAAAGG GATTGGGGCT TATCGTGAAG 60 

TTGGCATCCA AGCCCATGGA TATGTCTGTG ACGTGACAGA CGAGGACGGT ATCCAAGCCA 12 0 

TGGTCAAGCA AATCGAACAA GAGGTTGGTG T C AT TG AC AT CCTCGTTAAT AACGCTGGTA 180 

TTATCCGCCG AGTTCCAATG TGCGAAATGA GCGCCGCTGA TTTCCGTAAG GTCATCGATA 24 0 

TTGACTTAAA CGCACCATTT ATCGTTTCAA AGGCAGTTAT TCCTTCTATG ATAAAGAAAG 3 00 

GGCATGGAAA GATTATCAAT ATTTGTTCGA TGATGAGCGA ACTGGGACGT GAAACAGTTA 3 60 

GCGCTTATGC TGCTGCTAAA GGGGGCTTGA AAATGTTGAC CCGCAACATT GCGTCTGAAT 42 0 

ACGGTGG AG C CAATATCCAA TGTAACGGAA TTGGACCGGG TTATATTGCC ACTCCTCAAA 4 80 

CAGCACCTCT TCGTGAATTG CAAGAAGATG GTTCTCGCCA CCCATTTGAC CAGTTCATCA 540 

TTGCAAAAAC ACCTGCTGCA CGTTGGGGAA ATACTGAAGA TTTGATGGGC CCTGCTGTCT 600 

TTCTCGCTAG TGATGCCAGC AATTTTGTCA ATGGCCACAT CCTATATGTA GATGGCGGTA 6 60 

TCTTAGCCTA CAT C GG AAAA CAACCTGAGT AAAAATAGAA AG AAG AT C T T ATGAAAATCG 72 0 

CATTAATCAA TGAAAATAGT CAAGCTAGCA AGAATCACAT TATTTACGAT AGTCTAAAAG 7 80 

AAGCGACAGA TAAAAAAGGC TACCAATTAT TTAACTATGG TATGCGTGGA GAAGAAGGAG 840 

AAAGTCAATT AACTTATGTG CAGAACGGAC TAATGGCTGC CATCCTTTTA AATACAAAGG 9 00 

CAGTTGACTT TGTTGTTACC GGCTGTGGTA CGGGTGTAGG GGCTATGCTT GCTTTAAACA 960 

GCTTCCCTGG TGTTGTCTGT GGTCTAGCAG TGGACCCAAC TGACGCTTAC CTTTATTCTC 1020 

AAATCAATGG TGGTAACGCC TTGTCTATCC CTTATGCCAA AGGATTTGGC TGGGGGGCAG 1080 

AACTGACCCT CAAATTGATG TTTGAACGCT TATTTGCTGA AGAAATGGGC GGTGGCTACC 1140 
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CAAGAGAACG TGTAATCCCT GAACAACGCA ACGCTCGTAT CTTAAACGAG GTGAAACAAA 12 00 

TCACCCACAA TGATTTGATG ACCATCCTTA AAATAATCGA CCAAGACTTC CTCAAAGACA 12 60 

CCATCTCTGG CAAATACTTC CAAGAATACT TCTTTGAAAA CTGCCAAGAT GATGAAGTTG 13 2 0 

CTGCTTATTT GAAAGAAGTA T TAG C C AAGT AAAGCTATTC TAAACCAGAA AGGAACTAAT 13 8 0 

GGATGACGAA AATATTACTG TTTGGCGAAC CATTAATTCG AATTTCACCA TTAGATGCCA 14 4 0 

CCAGTATCGG CGATCATGTT GCCAGTTCGA CTTATTTTGG CGGATCAGAA ATTAACATCG 15 00 

CTTGTAATTT GCAAGCCCTG GGTATCTCAA CGAAAGTTTT TACCGCACTC CCTGCCAACG 15 6 0 

AGATTGGAGA TCGTTTTCTC ACATTCTTGA AACAGCACCA AATCGATACC AGTTCAATCT 162 0 

GTCGGCTTGG CGATCGAATC GGCCTCTACT ATTTGGAGAA CGGCTTTGGT TGTCGTCAAA 16 8 0 

GTGAAGTTTT CTACGATCGT AAGCATACGA GT AT C AG CCA GATTCGGCCA AACATGCTAG 1740 

ATATGGATTC TCTCTTTCAG GGGATTAGCC ATTTTCATTT TAGTGGAATC AC CGT AG CT A 1800 

TCGGTCAAGA GGTCCGTGCG ATCCTTCTCC TACTCTTGGA AGAAGCCAAG CGCCGAGGAA 18 60 

TTGTCGTTTC AATGGATCTC AATCTGAGAA CAAAGATGAT TTCAGTCCTA GAAGCCAAGT 192 0 

ATGAATTTTC TAAGTTTGCA CGTTTTACTG ACTATTGCTT CGGTATTGAT CCTCTCATGA 1980 

TTGATGACCA AAATCTAGAG ATGTTTCCAA GAGACAGTGC TAGCCTAGAA GAGGTGGAAA 2 04 0 

ATCGCATGCG ACTTTTAAAA GAAGCCTATG GTTTCAAGGC CATTTTCCAT ACCCTCCGCT 2100 

CTAGTGATGA GCAAGACAAA AATGTCTATC AAGCCTATGC TCTAGAAGAA CTATTTGAAG 2160 

AGTCTGTCCA ACTAAAAACT GCAGTCTATC AACGAATTGG TAGCGGGGAT G C CT T TAT AT 222 0 

CTGGTGCCCT TTACCAACTA CTCCATCATT CCTCCCTAAA AACTACCATT GACTTTGCAG 22 80 

TTGCGAGCGC AACTCTCAAA TGCACTCTTC CAGGAGACCA TCTCTCCACT TCCTCAACTA 2 340 

GTATTGAAAA TTTACTGGCA AATGCACAAG ATATCATTCG TTAGGAGAAT TACATGACCA 2400 

AATCAGATAC GATTATTGAA CTAAAAAAAC AAAAAATTGT CGCTGTTATT CGAGGAAATA 24 60 

CAAAGGAAGA AGGACTACAA GCCTCGATTG CTTGTATCAA GGGCGGTATC AAAGCTATTG 2 52 0 

AAATCGCCTA TACCAATCAG TATGCAGGAC AAATCATCAA GGAACTTGTA GACTTGTATC 2580 

AGGACGATCA GAGTGTTTGT ATCGGTGCAG GTACTGTGCT TGATGCCGTA ACTGCTAGAG 2 64 0 

ATGCCATTCT AGCTGGAGCA AATTACGTTG TTTCTCCATC TTTCCATGCT GAAACTGCGA 2 700 

AAATGTGCAA TCTCTACAGC ACACCGTACA TTCCAGGCTG TATTACCCTC AC AG AG AT C A 27 60 

CGACTGCACT TGAAGCCGGT AGTGAAATCA TCAAACTCTT CCCAGGTAGT ACTCTCAGTC 282 0 

CAGCATATAT CTCTGCAGTC AAGGCACCGA TCCCACAAGT TTCCGTAATG GTAACCGGAG 2880 
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GAGTCGGCCT AAACAACATC CCTCAATGGT TCGCTGCTGG TGCAGATGCC GTTGGAATTG 2 94 0 

GTGGCGAACT CAATAAACTC GCTTCCCAAG GCAACTTTGA CCGCATCAGC GAGATTGCCC 3 000 

AACAGTATAT TACACTCAGA T AAAAT C AT A ACTACCCGTC TAACGGGTGG TTTATCTCAG 3060 

AGCTATAAGC C C AAAT CAT C AGCCAGCGCC TAAAGACGCT GGCTTTCACG TTGTTCAAGC 312 0 

CTTATTGCTC TTGACTCGTC ACTTGCCTCT TTAAGAGACT TTGGTATTAC TTACCACTAT 3180 

CCCTAAAGGG AT C C T CAT AT TCTTTTACAC TCAATTTATC T AG TGCT AT A GTAGATTGAA 3 24 0 

ACTGGAATAG TACACCTCTG CTTCTAAAAC ATTGTTAAAA ATCGATTTGA CTGTCCTGAT 3 3 00 

CGATTTTGTC CTGTTCTTAT TTCATTTTAC TATATATCAT ACTTTACTCG TTCTCAAATT 33 60 

TTCATACTCA TGAAGAAATC ATCCACTCGA TAATTTCTTT AATCTTGACT ATATTTCTTA 3 42 0 

ATTGTGGCTT CATTAAGCCC TACTGGACTT ACATAATAAC CTTCCTCCCA GAAATGCCGA 3480 

TTCCCAAACT TGTACTTGAG ATTGGCGTGT TTGTCAAACA TCATGAGTGC ACTTTTGCCT 3 54 0 

T T T AAAT AC C CCATAAAACT TGAAACACTT AGCCTCGACG GAATACTGAC TAACATGTGT 3 600 

ACATGGTCTG GCATTAAGTG ACCCTCGATC ATTTCAACAC CTTTATAACT AC AC AAG C G A 3 660 

TGAAATATTT CGTCTAAACT ACTTCTATAT TGAT TAT AG A TGACTTTTCG TCTATACTTA 3 72 0 

GGGGTGAACA CAATATGATA GAACACCTCC ACTTTGTGTA TGATAAACTA TGAGTCTTTT 3 7 80 

GTGCCATATT TTTTCTCCTT TCGCTTTACA ATTGGATTGA ACACCTTTAT TGTATCGCGT 3 84 0 

TTGGAGTTTT TTTGGTATAA CCTTCGACGC GCACCCGTAT AGCGGGTGGT TGTTTTGTCT 3 9 00 

CGCACCTCAC GG AG CGAG AC GG AC TAAT AT AGTGGAGTGA AATAGGATAC GAACAAATTG 3 960 

ATTAGGAAAA TCAAATGAAT TTATAGAAAT CTTTTAGCAG TTATAACGTT CTATTCTAGT 402 0 

TTCAAAACGC TATAGTCACA TAATAATGAA GTAAAAAAGG ATAAGTATCA ACTTATCCTT 4 08 0 

TTTTAAAAGA AAAATCCGAA GATATTTGGC CTTCTTCGGA TTTTTTCTAT TTTCCACAGT 414 0 

TTCATGTAAT TCATCTAGAT GATGAACAAA TTAGTTGTTC TTTCCTCTAC G G AAT AG AT A 42 00 

AAATGCCCCA AGTAGCAAGA ACCCTAGACT TGCCAAGATT GACTGACCTT CTCCTGTCTG 42 60 

AGGGAGATTC TTTTGATCCG AATGGTTCTT TTCCTCTTCA GATTTTTCCT TTTCTTTTGA 43 20 

ATTCTGTACT TGTGGCTGAG CTGCTTGCTC TAGCTTTTTA AAG AC T TC C T GATCTGGAGC 43 80 

TGATTCCTGG GTTTCAGGAT TATAGTAGGC AATCTTATAT TCATCCCCTT CTTTTCGAAT 444 0 

GGTATAGACT CCACGTTTCA AAACTTGGAA TTGGTTGGAA ATAGTAGAGA C AG AATC AT C 4500 

ATATTTCACA ATGCCCCAAA CTCCTTGTTT AG CATC AT AA ACAGACTGAA GGGTTTCGTT 4 5 60 

ATTTTCGATG AGGCTACTTT CTAACTCTTT TATCATTTGA TTGAAGGTGG CACGATCCAC 462 0 

GTTAGGAATG AGC AT AT AG C CAT AAG AAT C TCTATTTTGC TTATGAGCCT GACTAATCGT 4680 
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AAGAAATTCA 


TTTTCAACTT 


CCTTGTCTGA 


CTGTCCTTCA 


TTGATATCCT 


TCCAGGCTCC 


4740 


CTTTTGCAAA 


GCCTTACTCA 


TACTGATTGA 


ACTCTTCTTA 


AAGAAAAAGT 


AACCAATATT 


4800 


CTTTTTCGAA 


TCGAACGATT 


CTAAAAAGAC 


ACTTTGGGTT 


TCAGGATAAT 


CCTTTTCTTG 


4860 


TTCTGTAAGG 


GAGGCTTCTT 


TAT CAT T G AC 


ATAGACTTTA 


TAT GG ATT AC 


CTGATTCCAG 


4920 


TTTTCTCTGG 


T C AATTGT AG 


TTGCAGCAGT 


ATCTGTTGAA 


GTGTTTTGGA 


TATTGCTTCC 


4980 


TAAAAAGGCG 


ATCTTATCCT 


TTAGCATAAA 


CCAGCTCTTA 


TGAGCAGTCA 


ATGTTTGATT 


5040 


CCAGTTGGTG 


AAATCCATGG 


TTGCTGTCGC 


ATTGGCATCA 


TCTAGTTTGC 


TCGTTCCAAC 


5100 


GAAAGCAGAC 


GGTAAAACTT 


TACCTGTATC 


GCTATCCGCT 


CTCTTAGCAT 


CCGTCTCTGT 


5160 


TGTACCAGGC 


ATCTTATATG 


GATTAACTGT 


TGGCCAGTAG 


CCATCGCTAT 


AGTGACTCAA 


5220 


ATCGCCATTG 


TAAAGATAGA 


ACATCCCATC 


ACTCGTATAC 


CAACCACGTT 


TATTTTCCTT 


5280 


GTTCATGTGT 


TCGTAATTCA 


AGGTACGACT 


GGAAAAGAGT 


GACAAGCCAA 


ATCCAAACCC 


5340 


TTTCTCTGCA 


TTGTACATGG 


CTGTTTTATC 


CATCTTGTTA 


AAGG C AG AT A 


GGTAACTTGG 


5400 


TCTTGGAACA 


CTTGCGACTC 


CTGCATCACT 


TAACAAGGAT 


TGCATCAAAC 


TGATATCCTT 


5460 


ATAAGTCTTC 


AAATTCTTAA 


AGACATCATA 


AT AACT AT C C 


GATTGAACAA 


TGGTCTTCAC 


5520 


AAGACTCTGC 


AAACATTGTT 


TGGTTTCTCC 


T T C AG AC AT A 


TCCGCTATTC 


GGTGAATCCC 


5580 


TCTTAGTACT 


TCTACTGCGG 


CCACGTGCCC 


CTCGCTATTT 


GCACGACTGA 


TCGAGCGTCC 


5640 


AC G AC TC AT A 


TCCATCAACT 


CTCCATTCAC 


CAGCAAAGGA 


GCAAACGATT 


TATCAATCCA 


5700 


GTGGTACATG 


GTTTGCATTT 


TATCTTTATC 


GATTGGATTC 


TTGGTCTTTT 


GAATGACTGG 


5760 


CAACAGTTGA 


GACAGGCCAT 


CAATCAAAAC 


ATTCCCATAA 


GCACCCGTAT 


AGGCAACATT 


5820 


GGTGTGGTCG 


AT AT AGG AT C 


CATCTTGATA 


AAAACCTTCA 


CCTTGGTCTA 


CCAACTTGAA 


5880 


CACTTGCTCA 


ATCGAGCGAA 


TGGTAGAAGA 


AATTTCTTGA 


T CAT C CTT AC 


GCAGTAAACC 


5940 


AGCTATTACT 


TTTACCCTTC 


CCATATCAAC 


TAAGTTTCCA 


CCTAGAGCCT 


TGAATGGGTT 


6000 


ATCAGTCGTC 


TTTCGGAAAT 


GTTCGGGATC 


TGGTACAAAT 


TTTTCAATCA 


C AT CTGT AT A 


6060 


TTTTTTAATT 


TCCT C AT C AG 


AGAAGTATTC 


TTTCATCAGA 


GACAAGGTAT 


TGTTGATGGC 


6120 


ACGAGGTGTA 


C CG AT T TC AT 


AATCCCACCA 


GTTCCCAACA 


ATGCTCTTTT 


CACTATTGTA 


6180 


GACATGTTTA 


TGCATCCATT 


CCATGGAATC 


CCTGACTGTT 


CGAACGACAG 


TTTCATCTTG 


6240 


ATAATAACGA 


GAAGAAGGAT 


TGGTCACTTG 


CTTGGCCATC 


TCCTCCAATT 


TCCGATAAGT 


6300 


GGCAGTCAGA 


TTTGCAGACG 


TTTTATAATT 


TGAAAATTTT 


TCCCACAAAT 


AGGTGCGGTC 


6360 


CGCCTGACTT 


GAAATACTGG 


AT AGGCT AT C 


AGCTACCTTT 


CCTTCCAATT 


CCTGGTTTAA 


6420 
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TTTGGCCATC TGTTCATTTT TAGAATCATA GTATTGATTC CCAGCGATGA TGCCATTCCA 6480 

GTCATCCAAA CGGTCTGTGT ATGCATCCTT AACAGAGGCC AGAATCTTCA AAGGAATCTT 6540 

TTTCACTTCC TTGCCATCTT TACTGACAAT GACATTGGTT GTCCCTTCCT TAAGAGGTTC 6600 

TAAAATTCCA TTTTTGACTG AAGCAACGTC AGGATTTTCT ACCTTATAAG TATAGTCCGC 66 60 

AAGAGAAAAA ACATGTTTTT TTCCAATTGG TAAATCAATC TTTTCCTCAA GCTGTTTATC 672 0 

TGTTTGAGAA T C CT C AG AAA GCTGGTCTGC TACCTCTACC AGCTCAATAT CCTTAAAGGA 67 8 0 

AACAGTCCCA GTTCCTGTTT CATAGAATAA CTCCAGCTTG ATTTTATCAA CAT C T AAAG T 6 84 0 

CGGGCTATAG TCTGCTTCAA TGGTCTGCCA GTCCTTTGTT CCTGACGTCG TTGCAGAATT 6900 

CCACAATCGC TTGTCCTTAC CACTTTCCTC AATG AT AC G A ACTTTGGCAA TCCCGATTTT 69 60 

ATTATCTGTT TTAATCTTGA AACGCAGTTT ATACTTTTTC TT AG CT T C AA TAGGAACCAT 7 02 0 

ACGGTGAAGC GCTGCCCTTA ATTTCTCATG GCTTGAGATA GTGATAGCCC CATCCTTAGC 7 080 

CT C AATG ACT CGAGTTGAGG CATCTGCACT ATTCTTCTGG TCTACCCAAG CTGACCACCC 714 0 

CCTGAGCTTT GCTTCCTGTC CGG 7163 



(2) INFORMATION FOR SEQ ID NO : 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9244 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

CGTTATAACA TACATGTAAG CGGTACCCAA AATGGTGCCA AGTCAAAATT TTTAAGGAGG 60 

AAAATACATG TCTTCACATC CAATTCAGGT CTTCTCAGAA ATTGGGAAAC TGAAAAAAGT 12 0 

TATGTTGCAC CGTCCAGGCA AGGAGTTAGA AAACTTGTTG CCGGACTATC TTGAAAGGCT 180 

TCTTTTTGAT GATATTCCTT TCTTGGAAGA TGCTCAAAAA GAACATGATG CATTTGCCCA 2 40 

AGCTCTTCGC GATGAAGGAA TTGAGGTTCT CT AC CT AG AA CAACTCGCTG CTGAATCATT 3 00 

GACCTCTCCA GAAATCCGCG ATCAATTTAT CGAGGAATAC TTAGACGAAG CCAACATCCG 3 60 

TGATCGTCAA ACCAAGGTTG CTATTCGTGA ATTGCTTCAC GGCATCAAGG ACAACCAAGA 42 0 

ATTGGTTGAA AAAACAATGG CTGGGATTCA AAAAGTTGAA TTGCCAGAAA TTCCTGACGA 4 80 

AGCTAAAGAT CT AACTG AC T TAGTTGAATC AGAGTATCCA TTTGCAATTG ACCCGATGCC 540 

AAACCTCTAT TTCACTCGCG ACCCATTTGC AACAATTGGA AACGCCGTAT CGCTTAACCA 600 

CATGTTTGCA GACACTCGTA ACCGTGAAAC ACTCTACGGT AAGTATATCT TCAAATACCA 6 60 
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CCCAATCTAT GGCGGAAAAG TGGATTTGGT CTACAACCGT GAAGAAGATA CGCGTATCGA 720 

AGGTGGAGAC GAGTTAGTTC TTTCTAAAGA CGTCCTTGCA GTAGGTATCT CTCAACGTAC 7 80 

AGACGCAGCT TCTATCGAAA AACTTTTGGT CAACATCTTC AAGAAAAATG TTGGCTTCAA 840 

GAAAGTTTTG GCCTTTGAAT TTGCTAACAA CCGTAAATTC ATGCACTTGG ATACTGTCTT 9 00 

CACTATGGTA GACTATGACA AGTTCACTAT TCACCCAGAA ATCGAAGGCG ACCTTCACGT 960 

TTACTCAGTT AC T T ACG AAA ACGAAAAACT TAAAATCGTT GAAGAGAAAG GTGACTTAGC 1020 

TGAACTTCTT GCTCAAAACC TTGGTGTAGA AAAAGTT C AT TTGATTCGTT GCGGTGGTGG 108 0 

CAATATCGTA GCAGCTGCGC GTGAACAATG GAACGACGGT TCTAACACTT TGACCATCGC 1140 

ACCTGGTGTG GTAGTTGTTT ATGACCGCAA TACCGTGACC AATAAGATTT TGGAAGAATA 12 00 

CGGGCTTCGC TTGATTAAGA TTCGCGGAAG TGAATTGGTT CGGGGCCGTG GTGGACCTCG 12 60 

TTGTATGTCT ATGCCATTTG AACGTGAAGA AGTGTAATCG CTGTTCGATA TTCGTCAATA 13 2 0 

GAAAATGTAA AAAATAGAAA GAGGAAATAA T AAAAT G AC A AATTCAGTAT TCCAAGGACG 13 80 

CAGCTTCTTA GCAGAAAAAG ACTTTACCCG TGCAGAGTTA GAATACCTTA TTGGTCTTTC 14 40 

AGCTCACTTG AAAGATTT G A AAAAACGCAA TATTCAACAC CACTACCTTG CTGGCAAGAA 1500 

TATCGCTCTC CTATTTGAAA AAAC AT CT AC TCGTACTCGT GCAGCCTTTA CAACTGCGGC 15 6 0 

T AT CG AC C TT GGTGCTCACC CAGAATACCT CGGAGCAAAT GATATTCAGT TGGGTAAAAA 162 0 

AGAATCTACT GAAGATACTG CTAAAGTATT GGGACGTATG TTTGACGGGA TTGAATTCCG 1680 

CGGATTCAGC CAACGTATGG TTGAAGAATT GGCAGAATTC TCAGGCGTTC CAGTATGGAA 174 0 

CGGTCTAACT GACGAATGGC ACCCAACTCA AATGCTCGCT GACTACTTGA CTGTTCAAGA 18 00 

AAACTTCGGT CGCTTGGAAG GCTTGACATT GGTATACTGT GGTGATGGAC GTAACAACGT 1860 

TGCCAACAGC TTGCTCGTAA CAGGTGCTAT CCTTGGTGTC AATGTTCACA TCTTCTCACC 19 2 0 

AAAAGAACTC TTCCCAGAAA AAGAAAT CGT TGAATTGGCA GAAGGATTTG CTAAAGAAAG 1980 

TGGCGCACAT GTTCTCATCA CTGAAGATGC TGATGAAGCA GTTAAAGATG CAGACGTTCT 2 04 0 

TTACACAGAC GTTTGGGTAT CAATGGGTGA AGAAGACAAA TTCGCAGAAC GTGTAGCTCT 2100 

TCTTAAACCT TACCAAGTCA ATATGGACTT AGTTAAAAAA GCAGGCAATG AAAACTTGAT 2160 

CTTCCTACAC TGCTTGCCAG CATTCCACGA TACTCACACT GTTTATGGTA AAGACGTTGC 2220 

TG AAAAATT T GGTGTAGAAG AAATGGAAGT AACAGACGAA GTCTTCCGCA GCAAGTACGC 2280 

TCGCCACTTC GATCAAGCAG AAAACCGTAT GCACACTATC AAAGCTGTTA TGGCTGCTAC 2340 

ACTTGGTAAC CTTTATATTC C T AAAGT AT A ATTTTAGATA AT AAAC CGT C TACCAACAGC 2400 



WO 98/18931 



PCT/US97/19588 



558 

TATGAGGGCT GCGACTAATA GCTTTAGTCC GGTCCTCTTT TATGTAATGG TAATCTATTA 24 60 

TTTCTTATAA AATATGTGAA AAATCATTAA ATTGAAATCT AAACGCATTC TATTGAGTGT 2 52 0 

GATAAAGGAG AATTTATGGC AAATCGTAAA ATTGTAGTAG CTTTGGGAGG AAATGCGATT 2 58 0 

CTTTCTTCTG ACCCATCAGC AAAGGCTCAA CAAGAAGCTT TAGTTGAAAC AG CT AAG CAT 2 640 

CTTGTAAAAT TGATTAAAAA TGG AG AT GAT CTG ATT AT C A CTCACGGTAA TGGACCTCAA 27 00 

GTTGGGAATC TCTTGCTCCA AC AT TTGGC A TCAGACTCTG AAAAGAACCC TGCCTTCCCA 2 7 60 

CTCGACTCAC TTGTCGCTAT GACAGAAGGT AGCATCGGTT TCTGGTTGAA AAATGCTTTG 2 82 0 

CAAAATGCTC TCTTGGATGA AGGCATCGAA AAAAATGTTG CCTCTGTTGT AACGCAAGTT 2 880 

GTCGTAGATA AAAATGATCC AGCTTTTGTT AACTTGAGTA AACCAATCGG TCCTTTCTAT 2940 

T C AG AAG AAG AAGCAAAAGC AG AAG C C G AA AAAAG CGG AG CGACTTTCAA GGAAGATGCT 3 000 

GGCCGTGGCT GGCGTAAGGT CGTTGCCTCA CCAAAACCTG TTGACATCAA AGAAATTGAA 306 0 

ACCATCCGTA CTCTTTTAAA TAATGGTCAA GTCGTCGTAG CTGCAGGTGG TGGCGGTATT 312 0 

CCCGTCGTCA AAGAAAACAA TGGACATTTG ACTGGTGTCG AAGCGGTTAT TGATAAAGAC 3180 

TTCGCTTCCC AACGTTTGGC AGAATTGGTT G AT GC AG AC C TCTTCATCGT TTTGACAGGT 3 24 0 

GTAGATTATG TATTTGTTAA C T AC AAC AAG CCAAACCAGG AAAAATTGGA ACATGTGAAT 33 0 0 

GTTGCCCAGC TGGAAGAATA TATCAAACAA GATCAGTTTG CACCAGGTAG CATGCTTCCA 3 3 60 

AAAGTAGAAG CAGCTATCGC TTTTGTCAAT GGTCGTCCAG AAGG AAAAG C AGTTATTACT 3 42 0 

TCCCTTGAAA ATCTAGGCGC CTTGATTGAA TCTGAAAGCG G AACAAT T AT TGAAAAAGGA 3 4 80 

TAAGTTGTTT TACTAATAAG ATGTATTCTA TTTCTAGTAT CTTTATATCA AATTAGAAAT 3 54 0 

TATTCTTGAA AACATGTACA ATATTTCAAA AG AT ACT AG T T TT AG AC T T T AATATGGTAA 3 6 00 

AACAAATATA AATAGAAAGC GTTTTCTTGA ATGTTTATTT AAGAAAGTAG TTGGTTTTTT 3 6 60 

ACACTTTGTT AGACATCAGG AGGAAAAACA AATGAGTGAA AAAGCTAAAA AAGGGTTTAA 3 720 

GATGCCTTCA TCTTACACCG TATTATTGAT AATCATTGCT ATTATGGCAG TGCTAACTTG 3 7 80 

GTTTATCCCT GCGGGGGCCT TTATAGAAGG TATTTACGAG ACTCAGCCTC AAAATCCACA 3 84 0 

AGGGATTTGG GATGTCCTCA TGGCACCGAT TCGGGCT AT G CTAGGTACTC ATCCAGAGGA 3 9Q0 

AGGTTCGCTC ATTAAAGAAA CGAGCGCAGC GATTGATGTA GCCTTCTTCA TCCTTATGGT 3960 

TGGTGGTTTC C TTGGC AT TG TCAACAAAAC TGGTGCTCTT GACGTAGGGA TTGCCTCTAT 4020 

CGTGAAGAAG TATAAGGGCC GCGAAAAAAT GTTAATTTTG GTACTGATGC CTTTGTTTGC 4080 

CCTCGGTGGT ACAACTTATG GTATGGGTGA AGAAACAATG GCCTTCTATC CACTCCTTGT 414 0 

GCCAGTTATG ATGGCCGTTG GTTTTGATAG CCTGACTGGT GTTGCAATTA TTTTGCTCGG 42 00 
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TTCTCAAATC GGCTGTTTGG CATCTACTCT GAATCCATTT GCGACAGGTA TTGCTTCAGC 42 60 

GACTGCGGGA GT TGGT AC AG GGGACGGTAT CGTACTTCGT CTGATCTTCT GGGTTACCTT 43 2 0 

GACTGCTCTT AGTACTTGGT TTGTTTACCG TTATGCGGAT AAGATTCAAA AAGATCCGAC 4380 

TAAGTCACTG GTTTATAGTA CTCGCAAAGA AG ATT TG AAA CACTTTAACG TAGAAGAATC 444 0 

TTCATCTGTA GAATCTACAC TTAGCAGCAA ACAAAAATCA GTTCTCTTCT TATTTGTGTT 4500 

GACATTCATC TTGATGGTAT TGAGCTTCAT TCCATGGACA GACCTTGGCG TTACCATTTT 45 60 

TGATGACTTT AATACTTGGT TGACTGGTCT TCCAGTTATT GGTAATATTG TCGGTTCATC 4620 

TACTTCTGCA CTAGGTACTT GGTACTTCCC AGAAGGCGCA ATGCTCTTTG CCTTTATGGG 46 80 

TATCCTGATT GGTGTTATTT ATGGTCTTAA AGAAGATAAG ATTATCTCTT CCTTCATGAA 4740 

TGGTGCTGCT GACTTGCTCA GTGTTGCCTT GATCGTAGCG ATTGCTCGTG GTATTCAAGT 4800 

TATCATGAAC GACGGTATGA TTACCGATAC AATCCTCAAC TGGGGTAAAG AAGGCTTGAG 4 8 60 

CGGTCTATCT TCACAAGTCT TTATCGTTGT AACTTATATC TTCTATCTAC C TATG T CAT T 4 92 0 

CTTGATCCCA TCTTCATCTG GTCTTGCCAG CGCAACTATG GGTATCATGG CTCCACTTGG 4980 

AGAATTTGTA AATGTCCGTC CTAGCTTGAT TATCACTGCT TACCAATCTG CTTCAGGTGT 5040 

CTTGAACTTG ATTGCACCAA CATCTGGTAT TGTGATGGGA GCTCTTGCAC TTGGACGTAT 5100 

CAACATTGGT ACTTGGTGGA AATTCATGGG CAAACTCGTA GTCGCTATTA TTGTAGTGAC 5160 

CATCGCCCTT CTTCTCCTTG GAACCTTCCT TCCATTCCTA TAAAATAGTG AGTGAGGTGA 52 2 0 

T T CC AT G AAA ATAGATATAA CAAATCAAGT TAAAGATGAA TTTCTTATAT CATTAAAAAC 52 80 

CTTGATTTCC TATCCTTCAG TACTCAATGA AGGAGAAAAT GGAACACCTT TTGGACAAGC 534 0 

AATCCAAGAT GTCCTAGAAA AAACT T TAG A GATTTGTCGA GACATAGGTT TCACTACCTA 54 0 0 

TCTTGACCCT AAAGGTTATT ACGGATATGC AGAAATCGGT CAGGGAGCAG AGCTTCTGGC 54 6 0 

CATTCTCTGT CATTTGGATG TTGTTCCATC AGGTGATGAA GCAGATTGGC AGACACCGCC 552 0 

ATTTGAAGCA ACTATCAAAG ACGGCTGGGT ATTCGGACGT GGTGTCCAAG ATGATAAAGG 5580 

CCCTTCGCTC GCAGCTCTCT ATGCAGTAAA AAGCTTGCTG GACCAAGGTA TTCAGTTCAA 564 0 

AAAGCGCGTA CGCTTTATCT TTGGTACCGA TGAGGAAACC CTCTGGCGCT GCATGGCACG 57 00 

CTACAATACC AT C G AAGAAC AGGCCAGTAT GGGCTTTGCA CCTGACTCAT CTTTTCCTCT 5760 

GACCTATGCT GAAAAAGGGC TTCTACAGGT CAAACTTCAT GGCCCTGGAT CGGATCAACT 5820 

AGAGCTTGAA GTAGGAGGCG CCTTTAACGT TGTACCAGAC AAGGCCAACT ACCAAGGTCT 5880 

CCTCTATGAA CAGGTTTGTA ACGGTCTCAA AGAAGCTGGT TAT G ATT AC C AAACCACTGA 594 0 
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ACAAACCGTA ACGGTTCTCG GAGTGCCAAA GCATGCTAAG GATGCTAGTC AAGGTATCAA 6000 

TGCTGTCATC CG ACT AGC T A CCATTCTTGC TCCTCTCCAA GAACACCCTG CTCTCAGTTT 6060 

TCTTGCAACA CAAGCAGGTC AAGACGGCAC AGGAAGACAA ATCTTTGGTG ATATAGCAGA 612 0 

TGAACCTTCT GGTCACCTAT CCTTTAATGT CGCAGGTCTC ATGATCAATC ATGAACGTTC 6180 

TGAAATCCGT ATTGACATTC GGACTCCTGT CTTAGCTGAC AAGGAAGAAC TAGTAGAGTT 624 0 

GCTTACAAGA TGTGCACAAA ACTACCAACT CCGCTACGAA GAGTTTGACT ATCTAGCGCC 63 00 

TCTATACGTC GCAGAAGACA GTAAACTCGT TAGCACACTG ATGCAAATCT ACCAAGAAAA 63 6 0 

GACTGGCGAT AACAGTCCTG CTATTTCATC CGGTGGTGCC ACTTTTGCTC GCACCATGCC 642 0 

AAATTGTGTA GCCTTCGGCG CCTTATTCCC AGGAGCGAAG CAGACAGAAC ATCAGGCAAA 64 8 0 

TGAATGTGCC GTTCTAGAAG ATTTGTACCG TGCTATGGAT ATTTATGCCG AAGCCGTCTA 654 0 

TCGACTTGCA ACT T AATC AG GCAACTGTTT CTACCAAAAA AAATCGACCG ATTAATGAAC 6600 

TGCACCCCAA AAGTTAGACA GAATAAATCT AACTTTTGGG GTGTTTTATT ATGAAATTGA 66 60 

GTTATGAAGA TAAAGTTCAG ATCTATGAAC TAAGAAAGCA AG G AC AAAG C TTCAAACAGC 672 0 

TTTCAAAAAG ATTTGGTGTG GATGTTTCTG GTCTAAAGTC ATCTGAATCT TTGAGATGAG 67 80 

CTTTATAAAT CGCTTTTTTC AGTTTTTGCA CTGGTGTTTC GATAAACTCA AACTTTTTAG 684 0 

CCGTGGTATT GCCTGATTTT ATAGTATATT GAAACTAGAA TAGTACACCT CTCCTTCTAA 69 00 

AACATTTTTA GAAATCGATT TGACTGTCCT GATCGATTTG TCCTGTTCTT ATTTCATTTT 696 0 

AC TAT ATT TG AGCCACTTCG TCTTTAACGG CTTTATTCAT AAGCTCTTGT AATTTTTCTT 7 02 0 

TACTATCAAT TACTTCTGAT TTTCCGTTGT AATTTATTGT AATAGGTTTT AACTTACCTA 7 08 0 

ATTTCTCGAC ACGCTCATTA ATTTGATCTT TTTTGAAGGC TGCTTATGTT TTTCCTAAGA 714 0 

TTTTTTCAAA AATATATTTA TCAGATAGCG GTTTGTCTTC TTCTTCAGCT TGGTTTTTGT 7200 

ATTAATTTGA AACATAAGGA ACAAATCCTT CATAGTAACC TAATGCTCCC ATAAGTTCAA 72 60 

AAGCTTGTTT TCTAATTCAA ACCATTGCAA CTCAGATTTC AGCTTTTCAG ATAAATCCTG 7320 

CT C AT C C AAA TAATGACTTG AAATTAGTGC TGAACTCGTT TCTGTATCCT GTACAGGCTG 73 8 0 

AG C ACC C AT A CCAGCAAAAA ATAAACTCGT TCCTAGCAAG ACCGAACAAG CTCCTATTGC 7440 

ATATGGCCTC AAAGAAAAAC GCTGCTTTCT CTCAAATTGA AATTCTTTCA TCCCATCTCC 7 500 

CATCATTCAT TATTACTGTA TATTTTGTAT ATCAGAAATA GTTTGTATTC ACAAATCTTT 7 5 60 

CTAGTTATTC CCTTATCATT CCTAATTAAG GGAGATAACA TACAATAATT TTTAGTTAAA 7 620 

TGTATATCGA TGTTTTTTGT TTTTCTTAAT AAACGCAATA C AAAAAG AG C CTGTTACCAA 7 680 

GCTCTTTGTA CT C AATG AAA AT C AAAG AGC AAATTAGGAA AC TAG C C AC A GGTTGCTCAA 7740 
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AACACCGTTT TGAGGTTGCA G AT AGAAC T G ACGAAgTCAG CTCAAAACAC TGTTTTGAGG 7 800 

TTGCAGATAG AACTGACGAA GTCAGTAACA TCTATACGGC AAGGCGACGC TGACGTGGTT 7 8 60 

TGAAGAGATT TTCGAAGAGT ATTAGTCTAT TATTTCTTCT CAGCGCGAAG GGCTGACAAG 7 92 0 

ATTTGTGTTC GGATATCATC CACACCATTT GGAGTATTTG GTAAAAAGAT AGTTTGATTT 7 980 

CCTTTAGAGG CAAAGGTATT CAAGGTATCC AAATACTGGT TGGTCAAGAG GATAGACATG 804 0 

ATTTGTTCTT CTGTCATGCC AACATTGGCT TCCTTGAGTT CGGTGATAGA CTCTGCCAAT 8100 

CCATCCACAA TCGCCTTACG TTGTTGGGCA ATCCCCACAC CATGAAGGCG GTCTTTTTCT 8160 

GCTTCTGCTT CAGCTGCAGT GACAATTTTA ATCTTGTCAG CTTCCGCCAA TTCTTGTGCT 8220 

GCGACCCGCT TACGTTGCGC CGCATTGATT TCATTCATGG ATTGCTTAAC TTCTGCATCT 82 80 

GGTTCGACCT TGGTAATCAA GGTTTTCACG ATAATGTAGC CGTAAGTGGT CATTTCTTCT 83 4 0 

GCTACTTGGT GTTGAACTTC AAGGGCAATC TCATCTTTTT TCTCAAACAA TTCATCCAAG 84 00 

GTTAATTTTG GAACAGAAGA GCGAAGAGCA TCTTCGATAT AAGATTTAAT CTGAGATTCT 84 60 

GGACGTATGA GTTTATAGTA AGCATCTGTC ACGCTCTGCT CGTTGACACG GTACTGAGTC 852 0 

GCTACATTCA TCATAACGAA CACATTGTCC TTGGTCTTAG TCTCAACCAC AAT AT C ACT T 8580 

TGCAACAAGC GC AAC TG AAT CCGTGCTGCA ATCGAGTCAA TCCCAAAAGG CAAGCGAATA 864 0 

TGAATACCGC TATTAGCAAC CTTTTGGTAT TTCCCAAAGC GT TC AAT AAT CGCCACCGAC 87 00 

TGCTGACGAA C C AC AT AAAC TGT ACT C AGT GTG AC T AT C A CC AAT AGG AG CACACAAACA 87 60 

AT C AG AAAAA T CAT GAAAAA TATTGCCATA AT GG AAC C T C CACAAGTATT TTTCTAGTAT 882 0 

TATAGCACAT TTAAAGAAGG CTGTGCCGTT TTTACTGCGA TTTTTCCTGA AATGTCAATA 8880 

ATTAGAGGTG AATTGTCCTA TTGTCGTCCA ATCTCTTGCT AAAATAACTC TTTATAAAAG 8940 

GCAATCGTTT CTTCTAAGGT TGG CATAAAT GGATTTCCTG GTGCGCAGGC ATCAATCAAG 9000 

GC ATT CT T AG AAAGGTATTC AAAGTCGAAA TCTTTTTCTT CAATACCAAG T T C AGT C AGT 9060 

T T C T T AGG AA TACCTACTGT CTCAGAAAGC TTCTCAATCT CAGCAATCGC ATAATCGGCA 912 0 

CATTCTTGAT CTGATTTACC TTCTACATGA AGTCCCAAGG CTTTGGCAAC ATTGCGGAAA 9180 

GCTTCTGGTA CACGTTTAGC ATTTTCACGT T CT AT AACTG GTAGCAACAT GG C AC AG C AC 9240 

ACGG 9244 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GATCTGAACT TTATCATCAT AACTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT 60 

TTTCTGTCTA ACTTTTGGGG TGTAGTTCAG TCATTGGACT GACGTTTTTT TGTATGCTTA 12 0 

TTTTGATTTG ATGTAGTTGA TACCATCTGC TTTTGGTGCG ACTGCTTTTC CAAAGAAGGC 180 

TGCTAAGACA AGAATTGTCA AAACATAAGG TGCAATTTGA AG AT AAAC CG CTGGCACTCC 24 0 

TTGTAGGAAC GGCAATTGAG AACCGATAAC AGCCAAACTT TGTGAAAGTC CAAAGAAGAG 3 00 

ACTAGAAAGC AT AG C AC CG A TTGGATTCCA TTTCCCAAAG ATCATCGCAG CAAGGGCGAT 3 60 

AAATCCAGGT CCAACAATAG TTGTCACTGA GAAGTTAACT GAGATTGATT GCGCATAAAT 42 0 

CGCTCCGCCA ATTCCACCTA GAAAACCTGA AATAATAACC CCTAAATATC TCATCTTGTA 480 

GACGTTGATT CCCAAGGTAT CCGCTGCTTG AGGATGTTCA CCGACAGAGC GGAGACGAAG 540 

ACCAAATTGA GTCTTAAAGA GAATAAACCA AGCAAGGAAT GAGAAGGCAA TCGCCAGATA 600 

ACCAAGTAGA CTAGTTGACT TGAAGAAGAT ATCACCAATC ACTGGGATAT TTGCCAAGAC 6 60 

TGGGAAATCA AAGCGTCCAA AAGTTTGACT TAGGTTGTCG GTTTGTCCTT TGTTATAAAG 72 0 

AACTTTAACT AAGAAAACAG CCAAGGCAGG CGCCATCAAG TTCAATACCG TACCGCTGAC 780 

AACATGGTCT GCACGGAAAT GAACCGTCGC TGCTGCGTGG ATGATAGAGA AAAC AC T AC C 84 0 

AACCAATCCT GCTACAAGCA AGGATAGCCA TGGAGTTGCT GCTCCAAATT GTTCTGCAAA 9 00 

TTCAAGGTTA AAGACAACTC CAGAAAAGGC ACCCATAACC ATAATTCCTT CAAGGCCAAC 96 0 

GT T T AC C AC A CCACCACGTT CAGAGAAAAC ACCACCGATA CTTGTAAAGA TGAGAGGTGC 102 0 

TGAGTAAATC AGCATAGAAG ACACCAAGAG GGGGAGCAAG GTTATAATAG ACATCTTTAC 1080 

TTACCTCCTT TAACTTGTTT TTTCGGTTTG ACAAAGCGTT CGATAAGGTA ATGAACACTG 114 0 

ACAAAGAAGA TAATAGACGC TGTTACAATG CTGACAAGCT CAGATGGTAC CTGCGCCGCA 12 00 

TTCATACCAG GAGCCCCAAC TTGGAGAACG CCAAATAGGA AGGCTGCAAA GAGTATACCA 12 60 

ATTGGTGAGT TGGCCGCAAG CAAACTAACC GCCATTCCGT TAAATCCGAT AGCTAATGAC 13 2 0 

GAACCTTGAA CATAGACGTT CTGGAAGGTT CCCAAACCTT CAACAGCTCC ACCAAGACCT 1380 

GCCAAGGCAC CTGAAATAAT CATAGATAGG ATAATAGTCC G CTTGG C AG A AAT AC C AG C A 1440 

TATTCTGAAG CATGTGGATT AAGACCAACT GCACGGATTT CAAAACCAAG AGTTGTTTTC 1500 

TTGAGCATGA ACCAAATAAC TGCAACGGCA ATGATGGCAA AG AAAAT AC C AATATTCATC 15 60 

CGTGAGTTAC CAGTCAACTC AGCCAACCAA GGTGTCTGAT AGGTTGCATT AGCCCCAACA 1620 
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CGAATGGTCG AATCTGTACT TTGCATGAAG TCTTTAGGGA AAGCATGGAT AAAGGCATTC 1680 

CCTACATACA AGACAATGTA GTTCATCATG ATGGTTACAA TAACCTCTGA CGTCCCTAGA 1740 

TAGGCCCTAA GAATACCTGG AATCGCTCCG ACAATCCCAC CAGCAATCAA GGCAATCACG 1800 

ATGGTTGCTA GAATCATCAA GGGACGGGGC ATATCTGGAT GCGACAGGGC AAACCAACCA 1860 

CTGAGAATCC AACCTGCCAA AGCCTGACCA GGAAGTCCGA CGTTAAAGAA ACCAGCTCGA 192 0 

CTGGCAACGG CAAAACCAAG ACCAATCAAG ACCAGAGGAC CCATAGCACG GAAGATTTCT 1980 

CCAATCCCAC GCAGACTGCC AAAGGCTGTA TAGAACAATT CTTCGTAGCC CCAAATAGCA 2 04 0 

TCATAACCGA AGATCCACAT G AC AATGGC T CCGAGTAAAA TTCCTAGGAA TACAGAAATC 2100 

AAGGGAACCG AAATTTGTTG TAATTTTTTA GACATCACTC TTCTCCTTTC CCAAGTTTCC 2160 

ACCAGCCATC AAGACACCAA GTTCTTGTTT ATTGGTTGTT TCTGGTGATA CAATACCTTG 222 0 

AATCTTACCA TCGTGGATAA CGGCAATACG GTCTGAGACG TTTAAAATCT CATCCAATTC 22 80 

AAAGCTGACA ACAAGGACAG CCTTGCCATT ATCACGCTCT TCAATCAAGC GTTTGTGGAT 2340 

ATACTCAATG G C AC CG AC AT CCAACCCACG AGTTGGCTGG CTAACGATAA GGAGATCAGG 24 00 

ATCTCGATCA ATTTCACGAG CAATAATTGC TTTTTGTTGA TTTCCTCCTG AGAGTGCAGC 2 460 

TGCAGGAACT AATTCACTGG C AG CG C G AAC ATCAAACTCT TCCATCAGCT TTTTAGCATA 252 0 

AGAAGTAATA TTTGAATAAT TCAAAATTCC ATTTTTACTA TGTGGTTCTT TATAGTAGGT 2 5 80 

TTGAAGGGCA ATATTTTCAG ATATCATCAT TTCCAAAATC AAGCCATCAC GGTGACGGTC 2 640 

TTCTGGAACG TGCCCAACAC TTAGTTCTGT AATCTGACGT GGGTGCAAGC CTACAATTGA 2 7 00 

ATCTCCTTTT AGCTCAATGC TACCAGATTC AACCTTACGA AGACCTGTAA TGGCTTGAAT 2 7 60 

CAGTTCAGAC TGACCATTTC CATCAATCCC CGCAATACCA ACAATCTCTC CAGCACGAAC 2 820 

ATCCAAGGAC AGATTTTTAA CAGCTGGAAC ACCACGGTTT T CAT T G AC C A CCAAATCTTT 2 880 

GATAGACAAA ACCACTTCTT TTGGTTTAGA GGCTTGCTTC TCTGTTTTAA AGGAAACAGA 2 940 

ACGTCCTACC ATCATTTCCG CCAAATCAGC ATTGGTAGCC CCTGCAATTT CAACGGTTTC 3 000 

AATTGATTTC CCACGACGGA TAACTGTAAC ACGGTCAGAA ACTGCTCGAA TTTCATCCAA 3060 

TTTGTGGGTA ATCAAGATAA TTGATTTTCC TTCTTTGACA AGATTTTTCA TAATAGCCAT 312 0 

CAACTCATCA ATTTCTGATG GAGTCAAAAC AGCCGTTGGT TCGTCAAAGA TAAGGATATC 3180 

AGCCCCCCGA TAAAGTGTTT TTAAAATTTC TACACGTTGT TGGGCTCCAA CTGAG AT AT C 3240 

TGCTACCTTG GCAGAAGGGT CAACAGCTAA GCCATAACGT T C AG AAAG AG CCTTGATTTC 3 3 00 

TTTGCTAGCT CCAGCGATAT CTAGCACACC ATTTTTAGTC AATTC AC T AC CTAAAATGAT 33 60 
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GTTTTCAGCC ACTGTGAAGG CTTCAACCAA CATAAAGTGC TGGTGAACCA TCCCGATTCC 3420 

CAAGCTAGCT GCTTTAGATG GGGAGTCGAG ATTGACAACT TGACCGTTGA CCGCGATTTC 34 80 

ACCACTAGTT GGTTCAAGAA GGCCTGCTAA CATGTTCATT AGCGTGGACT TACCAGCCCC 3 54 0 

ATTTTCTCCT AAAAGTG CAT GAATTTCACC TTTTCGTAGG TGCAAGTTGA TTTTGTCGTT 3 60 0 

GGCAACAAAT CCACCAAACA CCTTGGTAAT ATCACGCATC TCAATGACAT TTTCGTGTGC 3 660 

CATGTGCTCT TCCTTTCAGA GTCTTATTTT ATTTCAATAA AACTTGCTAG TTTGTCTAGT 372 0 

AGCAAGCTTT AC TT AG AC AA AATGACTTTG TCTCAACTCT TAAAAAAGCG GCCCTTGGCC 37 80 

GCTTCCTAAG AAATGACTTC CATCCATTAT TTTTCAGGAA CTTTTACGCT TCCATCAAGG 3 840 

ATTTTAGCTT TTGCATCTTC GACAGCTTTT TTACCTTCTT CTGAAAGGTT TGTTACTGCC 3 900 

AAGTCAACCC CTTTATCCTT CAATGAGTAA ACGATCACTT G AC CGCC AG G GAATTCTCCT 3960 

CTTTCTGCCT TGTTAGAAAT ATCTTTTACA GTTGTACCAA CTTGTTTCAA AGTAGATACA 4 02 0 

AGAACAAAGT TTGATTCTTT GCCATCTTTA GAAGTGTATT TACCTTCTGC TTCTTGGTCA 4 080 

CGATCAACAC CGATAACCCA AACTTTTTCA TTTTCAGGAC GGCTTTCGTT GAGAGATTTT 4140 

GCCTCTGCAA AGACACCTGC ACCTGTACCA CCAGCTACTT GGTAAACAAT ATCTGCACCG 42 00 

GCTGCGTATT GTGCGGCTGC AATTGTTTTA CCTTTAGCCG CAT C AC C AAA TGAACCAGCG 42 60 

TAGTCAACTT GGACTTTGAT AGATGGGTCT ACTGACGCAA CACCAGCCTT GAATCCTGCT 432 0 

TCAAAACGAG AG AT AAC T T C AGATTCGATA CCACCTACAA AACCAACTTG TTTTGTCTTA 43 80 

GTTGTTTTTG CTGCAGCCAC ACCTGCAAGg TAACCTGACT C ATT AT C AG C GAAAGTTACG 4440 

CTCGCAACAT TCTTTTGGTC TTTAATCACA TCATCAATCA AG AC AT AGT T CAAGTCAGTG 4500 

TGTTCTTTTG CTGCATCTTT AACTGCATTA TTAAGGGCAA AACCAACACC GAAGATTAGG 4 5 60 

TTGTAACTTC CAGCCGCTTG TTGCAAGTTG TTAGCGTAGT CAGCTTCACT TGTTGATTGG 4 62 0 

AAGTAAGTGA AACCGTTATC TTTTGAAAGA TTGTGTTCTT TACCCCAAGC CTGCAAACCT 4 680 

TCCCAAGCTG ATTGGTTGAA TGATTTGTCA TCAACACCAC C AGT AT C AGT GACGATTGCT 474 0 

GCTTTTGTCT T C AC ATC AG A AGATGAAGCT GCGTTACGAG AAGAGCGGTT AC C AC ATG C A 4 800 

GCAAGTCCAA CTGCTGCCAC TGCAACTAGG CCAAGACCTA GCCATTGTTT CTTGTTCATT 4860 

ACTGAACCTC CTAAATAAGA TGTGCAACGA TGTTGCAAGT ATGGATTGGT TGGCCACAAG 4920 

GACCGTGCCA C T C AG AG AG C G ACT C AG ACT AGTTTAAGTC TGTAAAAGAG TATGGAAGTA 4 980 

ATTCCCCGAC CGTCATCTCG ACCGTCGATT TATCTTTTGC GACTAAGGTC ACTTTTAGAT 5040 

CTTGTTCAAA AAATTCAGCC ATCACTTGGC GACAAGCACC ACATGGCGAG ATCGGTTTTT 5100 

CAGTTTGACC ATAGACAATC AATTCTGAAA ATTCTCTTTG GCCTTCAGAT ATAGCCTTAA 516 0 
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AAATAGCTGT TCTCTCACCG CAATTGGTCA AAGGATAGCT AGCATTTTCA ATATTCACTC 5220 

CCGTGTAAAC ACTTCCGTCT TTAGCTACTA AAACTGCTCC GATAGGAAAG TGAGAATAGG 52 80 

GGACATAGGC ATGTTTGCTG GTTTCAATTG CCAGTTCAAT CAACTCAGTA GTCGCCATCT 5340 

GCCAATTCTC CTTTTAAAAT AGCTACCCCA GCTGACGTTC CGATACGGGT CGCACCTGCT 5400 

TCGACAAAGG C AAG AG CAT C TGCATAAGAA CGAGCTCCAC CGGCGGCCTT GACACCCATA 546 0 

TC AG AT C C AA CTGTTTCACG CATTAATGTA ACATCTGCTA TCGTAGCACC ACCAGTTGAA 552 0 

AAGCCAGTAG ATGTTTTGAC AAAGTCAGCC CCAGCTTTTT GGGCCAATTG GCAAACAACA 5580 

ACTTTTTCTT GGTCTGTCAG AAGGCAAGCT TCAATAATGA CTTTCACTAA CTTATCACCA 5640 

CTTGCTTCCA CTACTGCGCG AATATCTGAC TCAACCAAGG CTAAATTACC TGATTTGAGA 5700 

GCTCCAACAT TGATCACCAT ATCAATCTCA TCTGCACCAT TTTGGATAGC TTCTTTTGTC 57 60 

TCAAATGCTT TCACGGCTGA AGTTGTTGCT CCCAAAGGGA AACCTACTAC TGTGCAAACC 582 0 

TTAACATCTG TGCCTTCAAG TCCTTTTTTA GCATGTTCAA CCCAGGTCGG ATTAACGCAA 5 88 0 

AC AC TGGC AA AGTCATACTC TCTAGCCTCA GACAACAAAC TATCAATTTG TTTTTTCTTT 5940 

GCATCTTGTT TTAAAAGCGT ATGATCTATA TATTTATTTA ATT T C ATT T C GGTTTTCCCT 6000 

CCATTTAGGA GATGATTTCT ACAATTTCAC GGATTTTTTT CACTTCATCA CTTATTTTAA 60 60 

CACATTTTTG GAAATCTGTA ACTAGTTGAG GTGGAATTTT TTCATTTGTG TATACTTTTG 612 0 

CAACAATTTC ACCCTTTTGA ACGGAGTCTC CAATCTTCTT TTCAAAAACA ATTCCTGTTT 6180 

CATAGTCCAA GGCATCAGAC TTAACTGCAC GACCAGCACC CAGCCTCATG GCATAAAGAC 62 40 

CAAAGTCCAT AGCTGGAAGA GCTGAAATGA CACCCGTTTC CTGAGCAGGG ATTTCCACCA 6300 

CATGAGCTAC ATTTACAGGA CGATAGAGGT CTTCCAAGTC TCCACCTTGG GCTTGCACCA 63 60 

TTTCCTCAAA CTTAGCCAGT GCT TG AC CAT TCTCAAGATG TTGGTGAACT TCTTCAACAG 6420 

TTTTGTTAAC ATTTGCCAAA CCAAGCATAA TTTGAGCCAA TTCACAAATA AAGTGGGTAA 648 0 

TATCCTGACG TCCTTGACCT TGCAAAATCT CCAATGCTTC AAGGATTTCC AGACGATTTC 6540 

CAATCGCTCG TCCCAAAGGC TGGCTCATAT CCGTAATCAC TGCTACTGTC TTCCGTCCAA 6600 

CAACCTTACC AAGATCTACC ATAGTTTGAG CCAACTCACG CGCCTCATCA ACCGTCTTCA 6660 

TGAAGGCACC CTCACCGACA GTCACGTCTA GCAAAATAGC ATCCGCCCCT GCCGCAATTT 6720 

TCTTGCTCAT CACCGAACTC GCAATCAAAG GAATCGTGTC GACAGTTGCG GTC AC AT C AC 67 80 

GAAGGGCATA GAGAAGCTTA TCTGCTTTGA CCAGCTGGTC TGATTGCCCA ATGACAGATA 684 0 

CTCCAATATC CTGAACCTGA CGAATAAAAT CCTCTTGACT ACGTTCTACT TGATAGCCCT 6900 
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TAATGGACTC CAATTTATCA ATTGTTCCGC CTGTATGGCC AAGACCACGA CCACTCATTT 6960 

TTGCTACAGG CACACCGAAG CTAGCAACAA GAGGAGCTAA AATCAAGGTT ACCTTATCGC 7 02 0 

CGACACCACC AGTAGAATGC TTGTCAACTT TCACACCATC AATGGCTGAC AGGTCAAACT 7 080 

CTTGCCCAGT CTTAACCATA TTCATCGTTA AAT C AG AG AT TTCTCGAGTC GTCATTCCTT 714 0 

TAAAATAAAC AGCCATAGCA AAGGCAGACA TCTGATAATC AGGAACAGTT CCTGATACAT 7200 

AGCCTTCTAT CAGCCATTCA ATTTCACTTG AAGTCAGTTC TTGACCGTCT CGTTTTTTTT 72 60 

GGATTAAATC AACTGCTCTC ATTCTTTCAC ACTTCTAAGG ATATAGTATC CCTTGTCTTT 73 2 0 

TTTAAGGATT T C AC AATTGC CAAACACATC TTCCATCTTA GACTTGGCAC TTGGAGCTCC 73 80 

TTGTTTTTTC TGGATGACGA TGGTCAAATC TCCACCAATT TCCAAGAAAT CTTTACTTTT 744 0 

CTCGATGATT TCATGAACGA CTTGCTTGCC CGC AC GG AT A GGAGGATTGG AAATGACATG 7 5 00 

GTCAAATCGC CCTTGAACTC TTGCATAAAT AT TAG ATT G A AATATCGTCG CTTTTGCATT 7 560 

ATTTTTTTCA GCATTTCTCT GAGCTAAATC CAGGGCACGA GTGTTAATAT CAACCATGGT 7 62 0 

CGCCTGAACT CCGTAAACCT TGACCAAGGA CAAACCTAAT GG AC C AT AAC CACAGCCTAC 7680 

ATCTAGGACT GTCTCTCCTT GGTTGACATC C AG AC ACTT G AGCAAGAGTT GACTTCCAAA 7 740 

GTCAACCATT TTCTTGCTAA AAACACCCGC ATCTGTCAAA AAAGTCATTT TTTCTCCCAA 7 800 

CAAGTCCACT CTCAACTCAT GAATGTCGTG AGCAGCGTCA GGATTTTCTG CATAGTACAT 7 8 60 

TTTACTCATG ACACTATTTT AC C AT AATT T GACTCAAATT GTAAATCGTT TACAAATTGA 7 92 0 

TAATAAAACG AAAAAGACCG AAGAAAGCAA GTCACGAAGC CATTTTCTTC AATCTCTTTC 7 9 80 

AACACTTATA AATAATAAAC CATTTAGAAC TATAAATATC ACAGTCCAGA TAAAAACAAA 8 04 0 

AAGTTTATCA TCTATAATCA GGCAGATTAT TATTTCTATT GCTTAACCTT AAAATACTTT 8100 

ATTATCAACA AAATTCCTAA CAAAATGTTT AG AT AAAAG C CCAACTGATA CGTTTATGTC 8160 

AGGATTTCCA AACTTGTCCA AAGTCGTATC AAATCTTCTA GTGACATGTG GAAGAAATAA 8220 

CCCTCTGTCG CAATCCGTAG GACTAAAAAG C AAT AAC T AC CCGCAGCAAT CCATTTCGTC 82 80 

CATCGTTTTT TAGTAAGAAA GCAATTAAGA ACGAACAAAT AAAG AC AG CT GTTACAATAG 8340 

CATGTTCCAT CAAAAAAGTA AAACCGTAAT AGGTTTCCAC AAAGCATCTA C C ATT AT C T G 840 0 

CATTGGTTCC TTTTATAAAA GGTAAAGCAA AACTTAAAAT AAAAC AG AG T TCCAATATGT 84 60 

AACGTTTTAA GATTTTCATA GTACACCTCC TATAAGTTGT GAACTAAAAA GCCCCCTTTA 8 520 

TAAGCTTATA AAT C AGT AG A ATCTATCTCC TATTTCATCA ATAAATTGAT CACTTATACT 85 80 

ATATACCATT GACTTACCAC ATT C AAG AAA CCGCTTTATT TTTTTAGCTT TTTATGGTAT 864 0 

GATAGACAAA ATATCTAGGG GAAAACAAAT GACCAACGAA TTTTTACATT TTGAAAAAAT 8700 
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CAGCCGCCAG ACTTGGCAAT CTTTACATCG AAAGACAACA CCTCCTTTGA CAGAAGAAGA 



8760 



ATTGGAATCT ATCAAGAGTT TTAATGACCA AATCAGTCTC CAAGACGTTA CAGATATCTA 



8820 



TCTCCCCTTG GCTCATTTGA TTCAGATTTA CAAGCGAACT AAGGAAGATT TAGCCTTTTC 



8880 



AAAAGGAATT TTCCTCCA 



8898 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

TATCTTAACG aGGATTGGGT TTATCGTCAG TCTTATTGCC CTAATTGTGG GAACAATCCC 60 

TT AAAT CAT T TTGAAAATAA TCGGCCTGTA GCAGATTTTT ACTGTAATCA TTGTAGTGAG 12 0 

GAGTTTGAAC TAAAGAGCAA AAAAGGAAAT TTTTCATCAA CAATCAATGA TGGTGCTTAT 180 

GCAACGATGA TGAAGCGTGT GCAGGCAGAT AATAATCCTA ATTTCTTTTT TTTAACTTAC 2 40 

ACAAAAAATT TTGAGGTAAA TAACTTTCTT GTCCTTCCGA AGCAATTTGT TACACCGAAA 3 00 

TCGATTATTC AAAGAAAACC ACTTGCACCA ACTGCTAGAC GAGCAGGTTG GATTGGTTGT 3 60 

AACATTGATT TATCACAAGT ACCTTCTAAA GGAAGGATAT TTCTTGTGCA AG AT GG AC AA 42 0 

GTTAGAGATC CAGAAAAAGT TACAAAAGAA TTTAAGCAAG GTTTATTTTT AAGGAAGAGC 4 80 

TCTCTGTCAT CAAGAGGTTG GACAATAGAA ATTCTAAATT GTATAGATAA GATAGAGGGT 54 0 

TCAGAATTTA CCCTTGAAGA TATGTATCGT TTTGAAAGTG ACCTAAAAAA TATCTTTGTT 6 00 

AAGAACAATC ATATCAAAGA AAAGATTAGG CAACAGCTTC AAATATTAAG AGACAAAGAA 660 

ATAATAGAAT TTAAAGGTAG AGGAAAGTAT CGGAAATTAT GAAAACGAAA CAACTTGT^G 72 0 

CATCAGAAGA GGTGTATGAT TTCTTAAAAG TCATCTGGCC TGATTATGAA ACTGAAAGCC 7 80 

GTTACGATAA CCTAAGTTTA ATCGTCTGTA CCTTATCAGA TCCCGATTGT GTGAGATGGT 84 0 

TATCTGAAAA TATGAAATTT GGTGACGAAA AACAACTAGC TTTGATGAAG GAAAAATATG 900 

GGTGGGAAGT AGGAGATAAA TTGCCAGAGT GGCTACATAG CTCCTATCAT AGATTATTGT 960 

TAATAGGTGA ATTATTGGAA AGCAATCTAA AACTGAAAAA GTATACAGTA GAAATTACAG 102 0 

AAACTTTATC ACGTTTAGTA AGTATAGAGG CTGAAAATCC AGATGAAGCC GAACGACTTG 1080 

TAAGAGAAAA GTATAAGAGT TGTGAAATTG TTCTTGATGC AGATGATTTT CAGGACTATG 1140 
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ACACTAGCAT ATATGAATAG GTAGATGTTT TTATTTTGTC AACAAAAAAG AGGCTCGCAC 1200 

CTCTTTTTCT TATTTCTTTT TATGATTTAA TACGGCATTG AGGACAATAG CGAGTAGGCT 12 6 0 

GGCTACGACG ATTCCGTTTG AGAAGAACAT TTGGAAGGCT GTCGGCATGC TGACAAAGAG 1320 

ATTACTGTTG TTGAGACCGA CACCTGCAGC GATTGAAACA GCTGCGATAA GGAAGTTGTG 13 80 

TTCATTGTTA GCAAAGTCAA CACGGGCGAG GATTTGCATC CCTTGAATTG ATACAAAACC 1440 

AAACATTACC AGCATGGCAC CACCGAGGAC GGAGCTTGGA ATGATTTGGG CAAGGGCGCC 15 00 

AAACTTAGGA AGCAGTCCAA GGAGAACCAG GAAACCAGCT GCGTAGTAGA TTGGCAGGCG 15 60 

TTTTTTGATG CCTGACAATT TAACCAAACC AACGTTTTGT GAAAATCCGG TGTAAGGGAA 162 0 

GGTGTTAAAG ATTCCTCCGA GAAGTACGGC CAAACCTTCT GCGCGGTATC CGTTGCGAAG 168 0 

GCGCGTGCTG TCGATTGGAT CCTTTGTGAT ATCAGACAAG GCCAGATAAA CACCAGTTGA 1740 

CTCAACCATA GACACCGTTG CGATGATACA CATCATGACA ATAGATGAGA TTTCAAAGGT 1800 

TGGCATCCCA AAGTAGAGTG GAGTTGGGAC ATGGACAAGT GGAGCTACCG CAACAGGAGA 18 60 

GAAGTCCACC AAGCCCATAG TAGCAGCAAT GGCAGTTCCA ACAACCAGAC CAATCAAAAT 192 0 

AGAGATAGAC TTGATAAATC CTTTGGTAAA GATGTTGATC AAGAGGATAA TCAGAACAGT 1980 

AATAGCTGCA AGCAAGAGAC TTTGACCAGT TGGCTCTGGA ACGTTATTTC CCATATTTCC 204 0 

AATAGCGACA GGGATCAAGG TTAAACCAAT CGTGGTAATA ACAGATCCTG TT AC GAT AG A 2100 

TGGGAAGAGA TTGGCTACTT TTGAGAAGAT GCCTGAAACA AGAACCACGT AAATCCCAGA 2160 

TGCGATAAGG GCACCAAACA TAGCGCCACT ACCATGGCTT TGCCCAATCA TAATCAAGGG 2220 

AGCGACCGAC TGGAATGCAA CTCCAAGAAC GACTGGGAGT CCAATCCCAA AGTATTTGTT 22 80 

GAGTTGGAGT TGGAGGAAGG TTGCCACCCC ACACATGAAG AT AT CT GT AG AAATCAGGTA 2 340 

GGTCAACTGC TCAGCTGAAT AGCCAAGGGC TGTCGCAATC ATGATGGGAA CCAGGATAGA 24 00 

TCCTGAGTAC ATGGCTAGTA AGTGCTGCAA GCCAAGAACG GCTGCTTGCG AGTGTTTTTC 2460 

TTGAGTTTGC AT TAG AG AT C TGCCTCCTTA AATACGACTT GACCATTTTC AAAACAATCC 2 520 

AAACGAGCAA GTGATAGGAC AGGGTAGCCT GCTTTTTCAA GCAAATCACG ACCATCTTGG 2 5 80 

AAGGATTTCT CAATCACGAT ACCGATAGCT TGGACTGTGG CACCGGCCTG TT CG AT GAT T 2 640 

TGAATCAAGC CTTTAGCAGC TTGGCCATTA GCAAGGAAAT CGTCGATAAT CAAAACCTTG 27 00 

TCCTCTGGTG AGAGGAATTT TTCAGCGATA GAAACGGTGC TGGTCACCTG CTTGGTAAAG 27 60 

GAGTAGACTT GAGCAGTTAA GATGCCTTCG TTCATGGTGA TGTTCTTAGC TTTTTTGGCG 2 820 

AAAATCATGG GAACGTTTAA GGCTTCAGCT GTAAAAACGG CTGGGGCAAT ACCCGACGCT 2 880 

TCAATGGTTA CGACCTTGGT AATGCCAGTA GT AG C AAATT TTTCCGCAAA AACCTTACCA 2 94 0 



WO 98/18931 



PCT/US97/19588 



569 

ATCTCTCGCA TCAAGCTAAA GTCAACTTGG TGGGTTAAAA AGGAATCTAC CTTGAGGATG 3 000 

T TAT C AC CCA AGATATGCCC ATCCTTGAGG ATGCGCTCTT CTAATAATTT CATAAGACCT 3 0 60 

CCTAAAGTCT AAAAGTTAAT TTACTTGTTG TTTAAATATT TCTATAGTGA TCCCTTTTGC 3120 

TAATACTATA TATTTGATAA AACTATTACG AGCGAAGCGA GTCTTATCAA ATATTTCCCG 318 0 

TTGTAGTGGT ATCATAGACA ATAATCTTGT TATTGTCTAT GACGGGATTT TTGAGAGTAA 3240 

AATAGTTCGG GGAACTATTT TAGCCTAAGC CTAGAAATGA AAGAGCTAGG GGCTCAAAAA 3 3 00 

TTAGGGATGA AATTCCCTGG ATTCCTGAAA TT ATT C AC AG GATAATTTCA CCTCCCGTCC 3 3 60 

GCACTAATTA AGGGAAATAT TAAAAAAAGA CCTACTTAAT CTCTAAGTAA GTCCCCTAAA 342 0 

TAGACATGGC AAAAACGGCC ATATCTCACT GCTGACTTAC TTATTGTTAG GTGTTCCGGC 3480 

AC CT TGT AG A AACGTCGTGC CAATTCACGA CATAAACAAG T AAAAC GAT A TTCAATTTTA 3 540 

AATAGGCTTG AGCCAATGTT TTTATTTTAC ACTAAATAAC TTTAGAAATC AACTATTTTG 3 600 

TTAGTGTTTT GGTTTAAAAA ACGAACAAAA AGAAGAGAGG GTGAACAAAA ACTCCATTGT 3 6 60 

AAGCTAACAG T TAT ACT AAA TGAAAATCAA AGAGCAAACT AGGAAGCTAT CCACAACCTC 3720 

AAAACACTGT TTTGAGGTTG TGGATAGAAT TGACAGAGCC AGTATCATAT ACCTACGGTA 3780 

AGGCGACGTT GACGTGGCTT GAAGAGATTT TCGAAGAGTA TTAGAAGATT TTTCCATCAT 3840 

AAAAGGCATA CTATCAAGCT TTTAGACACC TGACAATATG CCTTTTTCTA ACTTTAAAGA 3 900 

CTTTTCCCAA TTTTTATTAT TCTACTCGCT AAATCTTAAA AAATAGCCAT CTGGATCCAA 3 9 60 

AACTGCAAAT TTATGAGGAT AGATATAGGG ATCACTGACA CGAAACTTTC TTTTGGTCAA 402 0 

GGGACGATAA ATAGGATAGT TTGCCTTCAT CACTCTTTAA TAGAGTTTTG AAACATCCTT 4080 

TATGCCAAAG GAGAGATTGA CTCCACGACC AAAGGGATAG GTCAGTTCAG CTAGTTGATC 4140 

CTTTGTTCCC TCCTCTAACA TTAGTTGACA CTCTTCAAGA GAAAGAGAAA GTTTTCTTCT 42 00 

GGACGTTGGT ATTCAATCCT AAAACCCAGT AAACCACAGT AGAAGGACCG GGACTGTTCG 42 60 

ATATTCGATA CAAGCAACTC GGGAATGACC GCATTGTAGT CCATATAGAA AATCCTTACA 432 0 

AGTCAATTTC CAAGACAATC GGTGTATGGT CTTGGCGAGC ACCTGAGTCA ATCATATCAG 43 8 0 

ATTTAGTGAC CTTGTCAGCG ATACGGTTAC TTGTGAGCCA GTAGTCGATT CTCCAGCCTG 4440 

TATTGTTGAT TTTAGAAGTT TTGCTGCGTT GTGCCCACCA AGTGTAGCGT TCAGGAACAT 4 500 

CGCCATGAAC ATGGCGGAAG GTGTCTGTAA ATCCAGTTGC CAAAAGGTTG GTAAATCCAG 4560 

CACGTTCCTC GTCAGTAAAT CCAGGTGAAC GGCGGTTGCT AGCAGGATTT GCAAGGTCGA 462 0 

TTTCATTGTG GGCTACGTTG TAGTCACCGG TCGCAAGGAC TGGTTTTTCT TTGTCTAGTT 4680 
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CAGCCAAATA CTCAGCATAT TTGGCATCCC AGACTTGGCG TTCTTCCAAG CGTTTGAGAC 4 7 40 

CGTCACCAGC GTTTGGAGTG TAAACTTGGG TTACGAAAAA TGCATCAAAT TCTAGAGTGA 4800 

TGATACGACC TTCCAAGTCC ATGGTAGAAG GGGCACCGAT TTCTGGGAAG CTGATAGTAG 48 60 

GTGTAAGTTC TTTCTTATAA AGGAACATGG TTCCAGCATA GCCTTTACGG GCAGGCTCTT 4 920 

GGGAAGAGCG CCACGTGTTT TCGTAGCCTG GGAAGAGTTC TTCTAAAATT TCCACGTGTT 4980 

TCTTTGTAGG TCCTTTGGCA GAAAGCTTGG TTTCTTGGAT AGCAATGATA TCAGCATTTT 5040 

CAGCGACCAA GGTTTGTAGG ACTTCTTGGG ACAATTTGGC ACGAGCTGAG TCACTAGTTA 5100 

GGGCAGCGTT TAGGGAATCA ATATTCCATG AGATAAGTTT CATAAAGTTA CCTTTTTCAT 5160 

TCAGATTATA GATTTTATTA TACCAAAAAA AGATCTATTT CCCCAACGTA TGGTTTGAAA 52 2 0 

AATTACTCTC TTTCGTTTAT AATTAAGAAT GATTTTATGA AAGGGAGTGA AAATACATGA 52 80 

AATTCTACTC TTATGACTAT GTACTCAGCC AAATCGGTCA GCAAAATGGT ATCATGGTTG 534 0 

GCTTTGGGAT TGTTCTATTA G C TGTG AC AG TTTTTTTTGC TTTCAAGGCA TACCATAATA 54 00 

AAAAGGGAAG CGAATTTCGT GAGTTGGTCA TGATTTCAGA TCTGGCCTTA TTTAGCTCTG 54 6 0 

CTTTTGGTCA GCATCACGAC TTATCAAAAC AATCAAGTTT CTAACAATAA ATTTCAAACT 552 0 

TCACTTCATT TCATCGAGGT TGTTTCCAAA GATTTGTGAG TAGACAAGTC AGAAGTCTAT 5580 

GTTAATACTT CCACAAACAC AGATGGCGCA CTTATCAAGG TGGGAGATCG CTATTATCGT 5 64 0 

GCCCTAAATG GAAGTGAGCC AGACAAGTAC CTGTTAGAGA AAGTCGAATT GTATAAGACA 57 00 

GACGCAATTG AACTGGTGGA TGTGAACAAA TGACACTTAA TTATATCGAA ATTTTAATCA 57 60 

AACTGGTCTT G ACT CTC AAA TAGCTCAACA ACAATGTTCA CTTTGTGAAA CGTTTGATTG 5 820 

ATGGTAAGCC AACTCTCCTT AT C AAAAATG GGAATATTGA CCCAGAAGCC TGTCGTTCAG 5880 

TTGGTTTGTC TGCATCGGAT GTATCCCTCA AACTTCGTAG CCAAGGGATT TTCCAGATGA 5 9 40 

AGCAAGTCAA ACGAGCTGTG CAAGAGCAAA ATGGGCAACT CATCGTTGTG CAAATGGGAG 6 000 

ATGAAAATCC TAAGTATCCA GTTGTGACTG ACGGTGTGAT T C AAGT AG AT GTCTTGGAAT 60 60 

CGATTGGTCG TAGCGAAGAG TGGTTGCTTG ATAACCTCAG TAAACAAGGG CATGACAATG 612 0 

TAGCCAATAT CTTTATTGCT GAATATGACA AGGGTGCTGT T AC AG T C GT A ACTTATGAAT 6180 

AAGAAAAACC TGGGGTCTTG TACTCTTCGA AAATCTCTTC AAACCGCGTC AACGTCGCCT 6240 

TGCCGTATGT AGGTTACTGA CTTCGTCAGT TCTATCTACA AC CTC AAAGC AGTGCTTTGA 63 00 

GCAGCCTGCG GCTAGTTTCC TAGTTTGCTC TTTGATTTTC ATTGAGTATT GGCCTCAGGT 63 60 

TTCCATTTGC AATCAGAAAG GGATTTTATG TCCATTATTC AAAAACTTTG GTGGTTTTTC 64 2 0 

AAGTTAGAAA AACGCCGTTA TCTAGTCGGA ATTGTGGCCC TGATCTTGGT TTCCGTCCTC 6480 
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AATCTCATTC CTCCTATGGT TATGGGGCGG GTCATTGATG CCATCACATC GGGGCAATTA 6 54 0 

ACCCAGCAGG ACCTCCTTCT TAGCCTATTT TACTTGCTAC TTGCAGCCTT TGGTATGTAC 6600 

TATTTGCGCT ATGTGTGGCG T AT GT AT AT C CTTGGGACCT CTTATTGCTT GG G AC AG AT C 6 6 60 

ATGCGGTCTC GCTTGTTTAA GCATTTCACA AAAATGTCGT CAGCCTTTTA TCAAACCTAT 672 0 

CGGACGGGTG ATCTGATGGC ACACGCAACC AAT GAT AT C A ATGCCTTGAC TCGTTTAGCA 67 80 

GGTGGCGGTG TCATGTCTGC GGTGGATGCC TCTATCACGG CTCTGGTGAC TTTGTTGACC 684 0 

ATGCTCTTTA GCATCTCATG GCAGATGACT CTTGTTGCCA TTCTCCCCCT ACCTTTCATG 6900 

GCCTATACGA CTAGTCGCCT AGGGAGAAAG ACTCATAAGG CCTTTGGCGA ATCCCAAGCT 6960 

GCTTTTTCTG AAC T C AAT AA CAAGGTACAG GAGTCCGTAT CAGGTATCAA AGTGACCAAG 702 0 

TCTTTCGGTT ATCAGGCAGA CGAGTTGAAG TCTTTTCAGG CAGTCAATGA ATTAACCTTC 7 080 

CAAAAGAACC TGCAAACCAT GAAATATGAT AGTCTCTTTG ACCCTATGGT TCTCTTGTTT 7140 

GTTGGTTCGT CCTATGTTTT AACGCTTTTG GTTGGCTCCT TGATGGTTCA GGAAGGGCAG 7 2 00 

ATTACAGTTG GG AAT CT AG T CACCTTTATC AGCTATTTGG ATATGCTGGT CTGGCCTCTT 72 60 

CTGGCCATCG GTTTCCTCTT T AAT ACT AC T CAGCGAGGGA AGGTTTCTTA CCAGCGGATT 732 0 

GAAAATCTTT TGTCTCAGGA ATCTCCTGTA CAAGACCCTG AGTTTCCTCT GGATGGTATT 73 80 

GAAAATGGGC GTTTGGAGTA TGCCATTGAC AGCTTTGCTT TTGAAAATGA GGAAACACTG 74 4 0 

ACGGATATTC ACTTTAGTTT GGCAAAAGGG CAAACACTGG GCTTGGTTGG GCAGACAGGC 7 5 00 

TCTGGGAAAA CGTCCTTAAT CAAGCTCCTC TTGCGTGAAT ACGATGTGGA TAAGGGTGCC 7560 

ATTTATCTAA ACGGTCACGA TATTCGGGAC TATCGTCTGA CAGACCTTCG CAGTCTCATG 7 62 0 

GGCTATGTTC CTCAGGACCA GTTTCTTTTT GCGACTTCAA TCCTAGACAA TATCCGCTTT 7 680 

GGCAATCCTA ACTTGCCCCT TTCAGCGGTC GAGGAAGCTA CTAAGCTAGC CCGGGTTTAC 7 74 0 

CAAGATATTG TAGACATGCC TCAAGGATTT GATACGCTGA TTGGTGAAAA AGGAGTCACT 7 800 

CTTTCTGGTG GTCAAAAGCA ACGGTTGGCT ATGAGTCGGG CTATGATTTT AGACCCTGAT 7 860 

ATCTTGATTT TGGATGATTC CTTATCCGCC GTAGATGCCA AGACAGAGTA TGCGATTATC 7 92 0 

GACAACCTCA AGGAGATGCG AAAGGACAAG ACAACCATTA TCACTGCCCA TCGCCTCAGT 7 9 80 

GCTGTTGTCC ATGCAGATTT TATTTTAGTT CTACAAAATG GTCAAATTAT CGAACGAGGC 8 040 

ACGCACGAAG ACTTGCTAGC TTTGGATGGC TGGTATGCCC AAACCTACCA GTCTCAGCAG 8100 

TTGGAAATGA AAGGAGAAGA AGATGCAGAA TAAACAAGAA CAATGGACTG TATTGAAGCG 8160 

CTTGATGTCT TATCTCAAGC CTTATGGACT CCTGACCTTT TTGGCACTCA GTTTTCTCCT 8220 



WO 98/18931 



PCT/US97/19588 



572 

AGCGACGACG GTCATTAAAA GTGTCATACC CCTCGTGGCT TCCCACTTTA TCGACCAGTA 82 80 

TCTCAGCAAT CTTAACCAAC TAG C C GT T AC CGTTTTGCTG GTCTACTATG GT C T CT AC AT 8340 

C CT AC AAACT GTAGTTCAGT ATGTCGGCAA TCTTCTCTTT GCGCGCGTGT CTTACAGTAT 8400 

TGTTAGGGAT ATTCGTCGGG ATGCCTTTGC CAATATGGAG AAACTGGGCA TGTCTTACTT 84 60 

TGACAAGACG CC AG C AG GT T CTATCGTTTC TCGTTTGACC AACGATACCG AGACGATTAG 852 0 

TGATATGTTT TCTGGGATTT TATCCAGCTT TATCTCAGCA GTTTTTATCT TTCTGACAAC 8580 

CCTTTATACC ATGTTGGTGC TGGATTTTCG TTTGACGGCT TTAGTCTTGC TCTTTCTTCC 8 640 

TTTGATTTTC CTTTTGGTCA ATCTCTATCG AAAAAAGTCA GTGAAAATCA TCGAGAAAAC 87 00 

CAGAAGTCTC TTGTCAGATA TCAATAGTAA G CTGGC AG AG AATATCGAGG GAATCAGGAT 87 60 

TATTCAGGCC TTTAATCAAG AGAAGCGCCT GCAGGCAGAA TTTGATGAAA TCAACCAAGA 8820 

ACACTTGGTC TACGCCAACC GTTCTGTAGC CTTGGATGCC CTCTTTTTGA GACCTGCCAT 88 80 

GAGTTTGCTG AAACTTCTAG GCTATGCAGT CTTGATGGCC TACTTTGGCT ACCGTGGTTT 8 940 

TTCTATCGGG ATAACGGTCG GGACCATGTA TGCCTTTATC CAGTACATCA ACCGCCTTTT 9000 

TGACCCCTTG ATTGAGGTGA CGCAAAACTT TTCAACTCTG CAAACGGCTA TGGTTTCTGC 9 060 

AGGTCGTGTC TTTGCCCTGA TAGACGAGAG GACCTATGAA CCTCTTCAAG AAAATGGGCA 912 0 

AGCCAAAGTC CAAGAAGGCA ATATCCGTTT TGAACATGTG TGTTTCTCAT ATGACGGTAA 9180 

ACATCCGATT CTGGATGACA TTTCTTTCTC TGTTAATAAG GGTGAAACCA TTGCCTTTGT 9240 

AGGTCATACA GGTTCAGGGA AATCGTCTAT TATCAATGTC CTCATGCGCT TTTATGAATT 9 3 00 

CCAGTCAGGG AGAGTTCTCT TGGATGATGT GGATATCAGG GATTTCAGTC AAGAAGAGCT 93 60 

GAGAAAAAAC ATCGGTTTGG TCTTGCAGGA ACCCTTCCTC TATCATGGAA CTATTAAGTC 9 42 0 

CAATATCGCC ATGTACCAAG AAACCAGTGA TGAGCAGGTT CAGGCTGCGG CAGCCTTTGT 94 80 

GG AT G C AG AT TCCTTTATTC AAGAACTTCC TCAGGGGTAC GACTCCCCTG TTTCCGAGCG 9 540 

TGGTTCGAGC TTCTCTACTG GGCAACGCCA GCTTCTTGCC TTTGCTAGAA CAGTCGCCAG 9 600 

CCAGCCTAAA ATCCTGATTT TGGATGAAGC GACAGCCAAT ATTGACTCTG AAACAGAAAG 96 60 

CTTGGTTCAA GCTTCTCTGG C G AAG AT GAG ACAGGGCCGA ACAACTATTG CTATCGCTCA 972 0 

CCGCCTTTCT ACTATTCAAG ATGCCAACTG CATC TAT GTC TTGGATAAGG G AC GC AT TAT 9780 

CGAGAGTGGA ACCCATGAGG AACTCTTGGC TCTGGGAGGA ACCTATCACA AGATGTATAG 9840 

TTTGCAGGCA GGGGCCATGG CCGATACTCT TTGAAAATCT CTTTAAACCA TGTCAGCTTT 9900 

ATCTGCAATC TCAAAGCTGT ACTTTGATTT TCATTGAGTA CTAGAAGGAA ATCCTTCAAA 9 9 60 

T T AC AG ATTT CTTTCACCGC CTTTTCCATT T TGTGGT AT A ATGAAAAATG TTGACAAATA 10020 
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GTATAATAAA AACAAAGGAG AACAGCATGC TGAAATGGGA AGACTTGCCT GTGGAAATGA 1008 0 

AATCAAGCGA GGTTGAGTCT TACT AC C AG C TTGTCTCTAA AAGGAAGGGT TCGCTGATTT 1014 0 

TCAAGCGTTG CTTGGACTGG GTTTTGGCCT TGGTCTTACT GGTTCTGACC TCTCCCATCT 10200 

TTCTCATCTT GAGCATTTGG ATCAAGTTGG ATAGCAAAGG GCCAGTGATT TACAAGCAAG 10260 

AGCGTGTGAC CCAGTACAAC CGTCGGTTCA AGATTTGGAA GTTTCGTACC ATGGTGACGG 10320 

ATGCGGATAA AAAAGGAAGT CTGGTGACTT CTGCTAACGA TAGCCGCATT ACCAAGGTTG 10380 

GAAATTTCAT CCGACGTGTC CGTTTGGACG AACTGCCTCA GTTGGTCAAT GTCCTTAAAG 10440 

GTGAGATGTC CTTTGTCGGT ACACGACCTG AAGTGCCACG TTATACAGAG CAGTATAGCC 10500 

CTGAAATGAT GGCAACCTTG CTCTTGCAAG CAGGGATTAC CTCTCCAGCC AGCATCAACT 10560 

AC AAGG AT G A GGACACAATT AT C AGT C AAA TGACGGAGAA AGGTCTGTCA GTTGATCAGG 10620 

CCTATGTGGA GCATGTTCTT C CTG AAAAG A TGCGCTATAA CCTCGCCTAT CT C CG AG AGT 10680 

TTAGTTTCTT TGGGGACATC AAAATCATGT TTCAAACCGT GTTTGAGGTA CTAAAATAAA 10740 

GTAGTCATAA GAAAATGAGT ACAGATAAAA GGAGCAAATC AATGCCAAAT TACAATATTC 10800 

CATTTTCACC GC CTG AT AT C ACAGAAGCAG AAATTACTGA AGTAGTGGAT ACCCTGCGTT 108 6 0 

CTGGTTGGAT CACAACAGGT CCTAAAACAA AAGAACTGGA GCGCCGCTTG TCTCTTTACA 1092 0 

CACAGACACC TAAGACTGTT TGTCTCAACT CTGCGACAGC CGCTCTGGAG TTGATTTTAC 109 80 

GCGTTTTGGA AGTGGGACCT GGTGATGAAG TCATCGTTCC AGCCATGACC TATACGGCTT 11040 

CATGTAGTGT CATTACGCAC GTGGGAGCAA CCCCTGTCAT GGTGGATATC CAAGCAGATA 11100 

CGTTTGAGAT GGACTATGAC CTGCTTGAGC AAGCTATCAC TGAGAAAACT AAGGTGATTA 11160 

TTCCAGTAGA GCTCGCAGGG ATTGTTTGCG ATTATGACCG TTTGTTCCAA GTCGTGGAGA 112 2 0 

AAAAACGTGA CTTCTTTACC GCTTCAAGCA AGTGGCAAAA GGCCTTTAAC CGTATTGTCA 112 80 

TTGTCTCTGA T AGTGC C C AC GCTTTGGGAT CTATTTATAA AGGACAACCT TCTGGTTCTA 1134 0 

TCGCTGACTT TACTTCCTTC TCATTCCATG CAGTTAAGAA CTTTACAACG GCAGAAGGTG 11400 

GAAGTGCGAC TTGGAAAGCC AATCCAGTGA TTGATGACGA AGAGATGTAC AAGGAATTCC 11460 

AAATCCTTTC CCTTCACGGG CAAACTAAGG ATGCTCTTGC CAAGATGCAA CTGGGGTCAT 11520 

GGGAATACGA TATCGTTACA CCAGCCTATA AGTGCAACAT GACCGATATC ATGGCTTCAC 11580 

TTGGTTTGGT ACAATTGGAC CGCTATCCAA GTTTGTTGCA ACGCCGTAAG GACATTGTGG 11640 

ACCGCTATGA TAGTGGTTTT GCAGGTTCTC GCATCCATCC TTTGGCACAC AAGACTGAAA 11700 

CTGTCGAATC TTCACGCCAC CTCTACATCA CCCGTGTAGA AGGAGCAAGC CTAGAAGAAC 117 60 
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GCAACCTCAT CATCCAAGAA TTGGCTAAAG CAGGAATTGC AAGTAATGTT CACTACAAAC 11820 

CGCTTCCTCT CTTGACAGCC TATAAGAATC TTGGATTTGA TATGACGAAC TATCCTAAGG 11880 

CCTATGCCTT CTTTGAGAAT GAAATTACCC TCCCTCTTCA TACTAAATTA AGCGATGAAG 11940 

AAGTAGACTA TATCATTGAG ACTTTCAAAA CAGTTTCTGA AAAAGTGCTA ACTTTATCAA 12 000 

AAAAATGACA AACTACAGTC AAGCGAAAGT GATCCTGCCC CTAAAAAGTC TAATTGAGTG 12 0 60 

TAAAAACTGT TGTTTTCAAT TGATAATAGT TTACACCTGT AGTTGAGGCC CCTTTCTCCT 1212 0 

CAGAGAGAGA ATTTTTATAG GATTTTCCTT TCTTGTGGGA GTCCCGTGGT TTGAAATAAG 1218 0 

ATGTGAGCAA TTTAGTGTAG CATTTAGAAT CCTTACTAGA CATCATTTAG AAAATCTAGT 12240 

GTCTTGTTCT AGTTTTCAAT TCACCCTATT TTTTGAAAGA CGTGAGTTTC CATGAGTGAG 12 3 00 

ATTGTGGAAA CTCGCGTCTT TTTTTGTTTT CAGAATATTG TTCAAAATTT TGTGCCTGTC 123 60 

TTTCATGTTC TAGTCATTCT TTTGCATGAT AGAATTTATA GCATGTTGAT ATTATAATAA 124 2 0 

TACAAATATT CTATATGTTT AGTGATGCTT GC TAT AC AT T AT T AG ATCT C CTGCGAGACA 124 80 

AT CT AT AAAA CACTTGTCTA CG ATT AC CT A TATGCCCTAT TCCAGTATTT TAGAAGCACT 12 540 

GCATCTATTT TTATCGAGGT TAAATCTAGC T T TT AT AG AA GGTCTATTTA AGAAATATAT 12 6 00 

TGTAGTGTTT TAGTTTCAAT CCGCCATATG AG CG AT AT T C AGGTAAATAT CCCTGGCGAA 12 6 60 

TGCTTGTATG ACAAGGTATT TGTTCTTTCA TTTATAATTT AC AAC AT AT C AACAAATTTA 12 72 0 

AATATAGTAA ATGGGATATT T T AT ATT C AA GCTAAGAAAG ATAGCATCAC TTTTGAATGG 12 7 80 

AAGGCTAAAG AGC AAAC TAG GAAGTTGGCC AT AG AT AG CT CAAAACCCTG CTTTGAGGTT 12 84 0 

GTAGATATAG TAAAATGAAA TGAGAATAGG ACAAATTGAT CGGG AC AG T C AAATCG AT T T 12 900 

CTAACAATGT TTTAGAAGTA GAGGTGTACT ATTTTAGTTT CAGTCTACTA TAGAACTGAC 12 9 60 

CAAGTCAGTA ACCTAGACTT AGGGCAAGGC GGCACTGACC TAGTTTGAAG AGATTTCCGA 13 020 

AGAGTATAAA TTTTAATATT TTCTTGTGTT ATTCCTTGAC AATTCAATTT G G AAAAT AT A 13 080 

T G AT AAAG AT AATGAC AG CG GTGTCATTCT ATCTATTTTA AGAAAAGTAA TAATCAATTG 1314 0 

T T AAAAAT AG TAAAAAAATT GGAGGTTCTG AT G AAAT AT T TTGTTCCG 13188 

(2) INFORMATION FOR SEQ ID NO : 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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AACGAGTGCA TCAGTCTCAG CAAGCACCAG TGCGTCGGCC TCAGCAAGCA CCAGCGCGTC 60 
TGAATCCGCA TCAACCAGTG CCTCAGCTTC AGCAAGTACC TCAGCATCTG AATCAGCATC 120 
AACAAGTGCA TCGGCTTCAG CAAGCACAAG TGCTTCAGCC TCAGCAAGTA TCTCAGCGTC 18 0 

TGAATCGGCA TCAACGAGTG CGTCCGCTTC AGCAAGTACT AGCGCCTCAG CAT C AGCGT C 240 
AACAAGTGCT TCGGCTTCAG CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 300 
AGCCTCAGCA AGCACATCAG CTTCTGAATC TGCATCAACC AGTGCGTCAG CCTCAGCATC 360 
GACAAGCGCC TCAGCTTCAG C AAGT AC C AG TGCGTCAGCC TCAGCAAGTA CCAGTGCTTC 420 
AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG 4 80 

TGCGTCAGCC TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC 540 
AGCAAGTACT AGTG C AT C AG CATCAGCATC AACGAGTGCA TCGGCTTCAG CAAGTACCAG 600 
CGCCTCAGCT TCAGCAAGCA CCAGTGCGTC AGCCTCAGCA AGTACCAGCG CCTCAGCCTC 6 60 

AGCAAGCACC AGTGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 72 0 

TGCGTCGGCT TCAGCAAGTA CCTCAGCGTC TGAATCAGCA TCAACGAGTG CATCAGCTTC 7 80 

AGCATCAACA AGTGCTTCAG CTTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 84 0 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AG CAT C AG CG TCAACAAGTG CTTCGGCTTC 900 
AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG C AAGC AC AT C 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 102 0 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 108 0 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAg CCTCAGCAAG 1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 12 00 

AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 12 60 

CACCAGTGCG TCAGnCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 13 2 0 

AGCTTCAGCA AGTACCAGTG CGTCAgCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 13 80 

TACCTCAGCG T C TGAATC AG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 1440 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 15 00 

AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 15 60 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 16 8 0 

TGAATCTGCA TCAACCAGTG CGTCCGCTTC AGCGTCAACC AGTGCGTCGG CTTCAGCGTC 174 0 
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GACAAGTGCT 


TCGGCTTCAG 


CATCAACGAG 


TGCGTCGGCC 


TCAGCAAGCG 


CAAGTACCTC 


1800 


AGCGTCAGc t 


TCCGCCTCAA 


CCAGTGCGTC 


GGCTTCAGCA 


AGCACAAGTG 


CGTCAGCCTC 


1860 


AGCAAGTATC 


TCAGCGTCTG 


AATCGGCATC 


AACGAGTGCG 


TCTGAGTCAG 


CATCAACGAG 


1920 


TACGTCAGCC 


TCAGCAAGCA 


CATCAGCTTC 


TGAATCTGCA 


TCAACCAGTG 


CGTCAGCCTC 


1980 


AGCATCGACA 


AGCGCCTCAG 


CTTCAGCAAG 


TACCAGTGCT 


TCAGCCTCAG 


CGTCGACAAG 


2040 


TGCGTCGGCC 


TCAACCAGTG 


CATCTGAATC 


GGCATCAACC 


AGTGCGTCAG 


CCTCAGCAAG 


2100 


TACTAGTGCA 


TCAGCTTCAG 


CATCAACGAG 


TGCATCGGCT 


TCAGCATCAA 


CCAGTGCCTC 


2160 


GGCTTCAGCG 


TCAACCAGTG 


CGTCAGCTTC 


AGCAAGTACC 


AGTGCTTCAG 


TCTCAGCATC 


2220 


AACAAGTGCT 


TCAGCCTCAG 


CATCGACAAG 


TGCCTCGGCT 


TCAGCAAGCA 


CAT C AG CAT C 


2280 


TGAATCAGCG 


TCAACCAGTG 


CTTCGGCTTC 


AGCAAGTACC 


AGTGCTTCAG 


CTTCAGCATC 


2340 


AACCAGCGCC 


TCGGCCTCAG 


CAAGCACCTC 


AGCTTCTGAA 


TCGGCCTCAA 


CCAGCGCCTC 


2400 


GGCCTCAGCA 


AG C AC CTC AG 


CTTCTGAATC 


GGCCTCAACC 


AGCGCCTCAG 


CCTCAGCATC 


2460 


AACGAGTGCT 


TCGGCTTCAG 


CAAGCACAAG 


CGCCTCGGGT 


TCAGCATCAA 


CGAGTACGTC 


2520 


AGCTTCAGCG 


TCAACCAGTG 


CTTCAGCCTC 


AGCATCAACA 


AGTGCGTCAG 


CCTCAGCAAG 


2580 


TATCTCAGCG 


TCTGAATCGG 


CATCAACGAG 


TGCGTCTGAG 


TCAGCATCAA 


CGAGTACGTC 


2640 


AGCCTCAGCA 


AGCACCTCAG 


CTTCTGAATC 


GGCCTCAACC 


AGTGCGTCAG 


CCTCAGCATC 


2700 


GACAAGCGCC 


TCAGCTTCAG 


CAAGTACCAG 


TGCTTCAGCC 


TCAGCGTCGA 


CAAGTGCGTC 


2760 


GGCCTCAACC 


AGTGCATCTG 


AATCGGCATC 


AACCAGTGCG 


TCAGCCTCAG 


CAAGTACTAG 


2820 


TGCATCGGCT 


TCAGCATCAA 


CCAGTGCCTC 


GGCTTCAGCG 


TCAACCAGTG 


CGTCAGCTTC 


2880 


AGCAAGTACC 


AGTGCTTCAG 


TCTCAGCATC 


AACAAGTGCT 


TCAGCCTCAG 


CATCGACAAG 


2940 


TGCCTCGGCT 


TCAGCAAGCA 


CATC AG CATC 


TGAATCAGCG 


TCGACAAGCG 


CCTCAGCTTC 


3000 


AGCAAGTACC 


AGTGCGTCAG 


CCTCAGCGTC 


GACAAGTGCG 


TCAGCCTCAG 


CAAGTACTAG 


3060 


TGCATCAGCT 


TCAGCATCAA 


CGAGTGCATC 


GGCTTCGGCG 


TCAACCAGTG 


CATCAGAGTC 


3120 


AGCAAGTACC 


AGTGCGTCAg 


CTTCCGCATC 


AACAAGTGCC 


TCGGCTTCAG 


CAAGCACCAG 


3180 


TGCGTCGGCT 


TCAGCAAGTA 


CTAGCGCCTC 


AGCCTCAGCC 


TCAACCAGTG 


CGTCAGCCTC 


3240 


AGCAAGTATC 


TCAGCGTCTG 


AATCGGCATC 


AACGAGTGCG 


TCCGCTTCAG 


CAAGTACTAG 


3300 


CGCCTCAGCC 


TCAGCGTCAA 


CAAGTGCATC 


GGCTTCAGCG 


TCAACGAGTG 


CGTCTGAATC 


3360 


GGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTAGCGCC 


TCAGCCTCAG 


CGTCAACAAG 


3420 


TGCATCGGCT 


TCAGCATCAA 


CGAGTGCGTC 


CGCTTCAGCA 


AGTACTAGCG 


CCTCAGCCTC 


3480 


AGCGTCAACA 


AGTGCATCGG 


CTTCAGCGTC 


AACGAGTGCG 


TCTGAGTCAG 


CATCAACGAG 


3540 
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TGCGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCAGCCTC 3 600 

AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 3660 

TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC AGCCTCAGCA AGTACCAGTG CGTCAGCCTC 3 72 0 

AGCGTCGACA AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC 3 780 

AGCCTCAGCA AGTACTAGTG CATCAGCTTC AGCATCAACG AGTGCATCGG CTTCAGCATC 3 840 

AACCAGTGCA TCAGAGTCAG C AAGT AC C AG TGCGTCAGCT TCCGCATCAA CAAGTGCCTC 3 900 

GGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCTTCAG CTTCCGCGTC 3960 

AACCAGCGCC TCGGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CAAGTGCCTC 4 02 0 

GGCTTCAGCA TCAACGAGTG CATCAGTCTC AGCAAGCACC AGTGCGTCGG CCTCAGCAAG 4080 

CACCAGCGCG TCTGAATCCG CATCAACCAG TGCCTCAGCT TCAGCAAGTA CCTCAGCATC 4140 

TGAATCAGCA TCAACAAGTG CCTCGGCTTC AG C AAG C AC A AGTGCTTCAG CCTCAGCAAG 42 00 

TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC 42 6 0 

AGCATCAGCG TCAACAAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC 43 2 0 

AACGAGTACG TCAGCCTCAG C AAGC AC AT C AGCTTCTGAA TCTGCATCAA CCAGTGCGTC 43 80 

AGCCTCAGCA T CG AC AAG CG CCTCAGCTTC AG C AAGT AC C AGTGCGTCAG CCTCAGCAAG 4440 

TACCAGTGCT TCAGCCTCAG CGTCGACAAG TGCGTCGGCC TCAACCAGTG CATCTGAATC 4500 

GGCATCAACC AGTGCGTCAG CCTCAGCAAG TACTAGCGCC TCAGCCTCAG CATCAACGAG 4 5 60 

TGCGTCCGCT TCAGCAAGTA CTAGTGCATC AGCTTCAGCA AGTACTAGCG CCTCAGCCTC 4 620 

AGCGTCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 4680 

TGCGTCGGCT TCAGCAAGTA CCTCAGCGTC TGAATCAGCA TCAACAAGTG CGTCGGCTTC 474 0 

AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG CAAGTACCAG 4800 

TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG CCTCTGAATC 4860 

CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG CGTCAACGAG 4 920 

TGCGTCTGAG TCAGCATCAA CGAGTGCGTC AGCCTCAGCA AGCACATCAG CTTCTGAATC 4980 

TGCATCAACC AGTGCGTCAG CTTCCGCATC AACAAGCGCC TCGGCCTCAG CAAGTACAAG 5040 

TGCTTCAGCC TCAGCATCAA CCAGTGCATC AGCTTCAGCC TCAACAAGTG CTTCAGCCTC 5100 

AGCGTCAACC AGTGCCTCGG CTTCAGCAAG TACCAGTGCG TCAGCTTCAG C AAG C AC AAG 5160 

TGCGTCAGCT TCAGCATCAA CCAGTGCTTC GGCTTCGGCA TCAACAAGTG CCTCAGCATC 522 0 

AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TC AG CATC AG CATCAACCAG 52 80 
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TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA TCAACGAGTG CATC AG CATC 5 34 0 

AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA TCAGTCTCAG CAAGCACCAG 5400 

TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 54 60 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG 552 0 

TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 55 80 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCAGCmTCAG CATCAACGAG 5 64 0 

TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA AGCACCAGTG CGTCAGCCTC 5700 

AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC TCAGCTTCAG C AAGT AC C AG 57 60 

TGCGTCAGcT CAGCATCAAC AAGTGCTTCA GCTTCGGCCT CAACAAGTGC GTCAGCTTCA 582 0 

GCATCAACGA GTGCGTCGGC TTCAGCAAGC ACCAGTGCCT CGGCCTCAGC AAGCACCAGT 5880 

GCTTCAGCTT CAGCATCAAC AAGTGCGTCA GCTTCAGCAA GTACATCAGT TTCAAATTCA 5 94 0 

GCAAACCATT CGAACTCACA AGTTGGAAAT ACTTCTGGAT CGACAGGTAA ATCCCAAAAA 60 00 

GAATTGCCTA ATACAGGTAC TGAGTCGTCA ATTGGATCTG TGTTACTTGG AGTTCTAGCA 6 0 60 

GCTGTTACAG GT ATT GG AT T GGTTGCGAAA CGCCGTAAAC GTGATGAAGA AGAGTAAGAC 612 0 

AACCTGTAAA GTTAGGCTAA ACTAACTCGC GCACATAAAT CAAGGAGAAA ATTGCTAGTG 6180 

GAT G AT AAAA TAACAGTCAT TGTACCAGTA TACAATGTGG AAAACTATCT GAGGAAGTGC 624 0 

CTAGATAGTA TTATTACTCA AACATATAAA AATATTGAGA TTGTTGTCGT TAATGATGGT 63 00 

TCTACGGATG CTTCAGGTGA AATTTGTAAA GAATTTTCAG AAATGGATCA CCGAATTCTC 63 60 

TATATAGAAC AAGAAAATGC TGGTCTTTCT GCCGCACGAA ACACCGGTCT GAATAATATG 64 2 0 

TCCGGAAATT ATGTGACCTT TGTGGACTCG GATGATTGGA TTGAGCAAGA TTATGTAGAA 64 8 0 

ACTCTATATA AAAAAATAGT AGAGTATCAG GCTGATATTG CAGTTGGTAA T T AT TAT T CT 6540 

T T C AACG AAA GTGAAGGAAT GTTCTACTTT CAT AT AT T G G GAGACTCCTA TTATGAGAAA 6 600 

GTATATGATA ATGTTTCTAT CTTTGAGAAC TTGTATGAAA CTCAAGAAAT GAAGAGTTTT 6660 

GCTTTGATAT CTGCTTGGGG TAAACTCTAT AAGGCAAGAT TGTTTGAGCA GTTGCGCTTT 672 0 

GACATAGGTA AATTAGGAGA AGATGGTTAC CTCAATCAAA AGGT AT ATT T ATTATCAGAA 67 80 

AAGGTAATTT ATTTAAATAA AAGTCTTTAT GCTTATCGGA T T AG AAAAGG TAGTTTATCA 684 0 

AGAGTTTGGA CAGAAAAGTG GATGCACGCT TTAGTTGATG CTATGTCTGA ACGTATTACG 6 9 00 

C T ACT AGCT A ATATGGGTTA T C CT C TAG AG AAACACTTGG CAGTTTATCG T C AG AT GT T G 69 60 

GAAGTCAGTC TCGCCAACGG TCAAGCTAGT GGTTTATCTG ACACAGCAAC GTATAAAGAG 7 02 0 

TTTGAAATGA AACAAAGGCT TTTAAATCAG C TAT CG AG AC AAGAGGAAAG TGAAAAGAAA 7080 
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GCCATTGTCC TCG C AG C AAA CTATGGCTAT GTAGACCAAG TTTTAACGAC AATCAAGTCT 714 0 

ATTTGTTATC ATAATCGTTC GATTCGTTTT TATCTGATTC ATAGCGATTT TCCAAATGAA 72 0 0 

TGGATTAAGC AATTAAATAA GCGCTTAGAG AAGTTTGACT CAGAAATTAT TAATTGTCGG 72 60 

GTAACTTCTG AGCAAATTTC ATGTTATAAA TCGGATATTA GTTACACAGT CTTTTTACGC 73 2 0 

TATTTCATAG CTGATTTCGT GCAAGAAGAC AAGGCCCTCT ACTTGGACTG TGATCTAGTT 73 80 

GTAACGAAAA ATCTGGATGA CTTGTTTGCT ACAGACTTAC AAGATTATCC TTTGGCTGCT 7440 

GTTAGAGATT TTGGGGGCAG AGCTTATTTT GGT C AAG AAA TCTTTAATGC CGGTGTTCTC 7 50 0 

TTGGTAAACA ATGCTTTTTG GAAAAAAGAG AATATGACCC AAAAATTAAT TGATGTAACC 75 60 

AATGAATGGC ATGATAAGGT GGATCAGGCA GATCAGAGCA TCTTGAATAT GCTTTTTGAA 7 62 0 

CATAAATGGT TGGAATTGGA CTTTGATTAT AAT CAT ATT G T CAT T C AT AA ACAGTTTGCT 7 68 0 

GATTATCAAT TGCCTGAGGG TCAGGATTAT CCTGCTATTA TTCACTATCT TTCTCATCGG 774 0 

AAACCGTGGA AAG AT T T GG C GGCCCAAACC TATCGTGAAG TTTGGTGGTA CTATCATGGG 7 8 00 

CTTGAATGGA CAGAATTGGG ACAAAACCAT CATTTACATC CATTACAAAG ATCTCACATC 786 0 

TATCCAATAA AGGAACCTTT CACTTGTCTA ATCTATACTG CCTCAGACCA TATTGAACAA 7 92 0 

ATTGAGACAT TGGTTCAATC CTTGCCTGAT ATTCAGTTTA AGATAGCAGC TAGAGTAATA 7980 

GTTAGTGATC GATTGGCTCA GATGACAATT TATCCAAACG TGACTATATT TAACGGAATT 8040 

CACTATTTGG TAGATGTCGA TAATGAATTG GTAGAAACCA GTCAAGTACT TTTAGATATT 8100 

AATCATGGCG AAAAGACAGA AGAAATTCTC GAT C AAT T TG CTAATCTTGG CAAGCCTATC 816 0 

TTATCCTTTG AAAATACTAA AACCTATGAA GTAGGTCAGG AGGCATATGC TGTTGACCAA 822 0 

GTTCAAGCAA TGATTGAAAA ATTGAGAGAA ATAAGCAAAT GAAGAAAAAT CATTTAGTAG 8280 

GAGATGCTCT GATTTTGACG GTTAGTGATC AGATTGAAGA GTTGGATTAT TTTTTATAAA 8340 

ATTTCTCCGT TC AT CAT AT A TGAAAGTTGT TCAAACATCA GAGTGCTTTA TAAAATATAA 84 0 0 

ATAGACCTAA AGATATTTAA TATGAACTGC ACCCCAAAAG TTAGACAGAA AAAATCTAAC 84 60 

TTTTTGGsGT CAGTACAATA TTAGGGTGTG ATTAATTATC TTTTTAGGTG AAAATGATTC 852 0 

TATATTATAG CTGTTTGATA CGAAATTTAT TATAAGGAAA TTATGTTAAT GAATACAAAA 8580 

TCTATAGTTT TTAATGCAGA T AATG AT TAT GTAGATAAAT TAGAAACTGC AATTAAATCT 8 640 

ATTTGTTGTT ATAATAATTG TTTAAAATTT TATGTATTTA AT GATG AT AT TGCGTCAGAG 8700 

TGGTTTTTGA TGATGAATAA GCGATTGAAG ACTATACAAT CTGAAATCGT TAATGTAAAG 87 60 

ATTGTAGATC ATGTTCTTAA AAAGTTTCAT TTACCGTTAA AGAATTTAAG TTATGCCACT 882 0 
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TTCTTTCGTT ATTTTATACC TAATTTTGTC AAAGAAAGTC GTGCTTTATA CCTAGATTCT 8 880 

GACATCATTG TTACAGGAAG TTTAGACTAT TTATTTGATA TAGAACTAGA TGGTTATGCC 894 0 

TTGGCAGCAG TAGAAGATTC TTTTGGTGAT GTTCCTTCTA CCAATTTTAA CTCCGGAATG 9000 

TTATTAGTTA AT G TAG AT AC T TGG AG AG AT GAAGATGCTT GTTCGAAACT GTTAGAACTG 9060 

ACCAATCAAT ATCATGAAAC AGCATATGGA GATCAAGGAA TTTTAAATAT GTTATTCCAT 912 0 

GATAGATGGA AAAG AT TAG A CCGAAATTTT AATTTTATGG TGGGGATGGA TAGCGTCGCA 9180 

CACATAGAAG GAAATCATAA ATGGT AT GAG ATTTCTGAGT TGAAAAATGG AGATTTACCT 924 0 

AGTGT TAT AC ATTATACTGG GGTAAAACCT TGGGAAATAA TTTCCAATAA TCGCTTTAGA 9300 

GAAGTTTGGT GGTTTTATAA TCTGTTAGAA TGGTCTGATA TTTTATTGAG AAAAGACATT 93 60 

ATTAGTCGTA GTTTCGAAGA ACTTGTATAC AGTCCTAAAG CTCATACAGC AATTTTTACA 942 0 

GCTAGTTGTG AGATGGAGCA TGTAGAATAT T TG AT AG AAA ATTTACCAGA GGTACATTTT 94 80 

T CT AT ACT AG CACATACATA TTTTGCGTCT AGTGTCGTTG CTTTATTAAG ATATAGCAAT 9540 

GTTACGATTT ATCCTTGTTT TTCTCCATTT GATTATCGAA AAATTTTGGA TAATTTAGAT 9600 

TTTTATTTAG AT AT T AAT C A TTATAAAGAA GTGGATAATA TTGTATCCGT TGTTCAACAA 96 60 

CTATCTAAAC CAATTTTTAC CTTTGAAAAT ACTAGTCATG ATATAGGCAA TCAAACTAAT 9 72 0 

ATATTTTCTT CAACCGAACC AAACAAAATG GTAGAGGCTA TT AG AC AAT T TATAGGAGAA 97 80 

TAAGTTTATG GCAGACGAAC TAATTAGTAT TGTAGTTCCA ATCTACAACG TTGAGAATTA 9840 

TTTGCGAATG TGTTTGGATA GCATTCAGAA TCAGACGTAT CAAAATTTTG AGTGT T T ATT 9 9 00 

AATCAATGAT GGCTCTCCAG AT C ATT CATC CAAAATATGT GAAGAATTTG TAGAGAAAGA 9 9 60 

TTCTCGTTTC AAATATTTTG AGAAAGCAAA CGGCGGTCTT T CAT C AG CT C GTAACCTAGG 10020 

TATTGAATGT TCGGGGGGGG GCGTACATTA CTTTTGTAGA CTCTGATGAT TGGTTGGAAC 100 80 

ATGATGCTTT AGACCGATTA TATGGTGCTT TGAAAAAGGA AAAC G C AG AT ATT AG TAT CG 10140 

GGCGTTATAA TTCTTATGAT GAAACACGCT ATGTGTATAT GACTTATGTT AC GG AT C C AG 10200 

ATGATTCTCT AGAAGTGATA GAAGGTAAAG CAATTATGGA TAGGGAAGGT GTCGAAGAAG 10260 

TCAGAAATGG GAACTGGACT GTAGCTGTCT TGAAGTTATT CAAGAGAGAG TTACTACAAG 103 20 

ATTTACCATT TCCTATAGGA AAAATTG C AG AGGATACTTA CTGGACATGG AAGGTACTTC 10380 

TAAGAGCTTC GAGGATAGTC TATTTGAATC GTTGTGTTTA CTGGTACCGT GTTGGTTTAT 1044 0 

CTGATACTTT AT CG AAT AC A TGGAGTGAAA AGCGTATGTA TGATGAAATT GGGGCTAGGG 10500 

AAGAAAAGAT AGCTATTTTA GCAAGTTCAG ACTATGACTT G AC C AATC AT ATTTTGATTT 105 60 

ATAAAAATAG ATTACAAAGA GTGATAGCAA AAT T AG AAG A ACAAAATATG C AG TT C AC AG 10 620 
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AG ATT T AC AG AAGAATGATG GAAAAATTGT CTTTACTTCC GTAGATAGTA ATAAAAAATG 10680 

AG AT AGC GT A ATATGAAACT ACATTTAACA AATTTATACG GCATGGCTGG TGATAGTACG 1074 0 

GTTATCTTAG CTCAAAATGC TGTTCAAAAG ATAGCTAGTC AACTGGGATT TAGAGAGGTT 10800 

GGTATTTATT TTTACAACAT TGCTTCAGAT AGTCCTTCTG AAATGAATAA GCGTCTGGAT 108 60 

GGTATTATGG CCAGTATCTC TATTGGGGAT ATTTTAGTCT TTCAGTCTCC AACCTGGAAT 10 92 0 

GGTTTTGAAT TTGATCGTCT CTTGTTTGAT AAGCTAAAGG ATATGCAGGT GAAAATTATT 109 80 

TGCTTTATCC ATGATGTTGT TCCCCTCATG TTTGATAGTA ACTATTATCT CATGAAAGAT 11040 

TATCTGTATA TGTATAATCT ATCAGATGTT TTGATAGTGC CGTCAGAGAG AATGAAAACA 11100 

CGCCTGATGG AAGAAGGATT GACGACTAAG AAGATTCTTG TTCAAGGGAT GTGGGATCAT 11160 

CCTCATGATT TATCCTTATA CACCCCTGCT TTTAAAAAAG AACTTTTTTT TGCTGGAAGT 112 2 0 

TTAGAGCGTT TTCCAGACTT ACAAAATTGG TCTCAAGATA CGCCTTTGAG AGTATTTTCA 112 80 

AATAAAGGGG AAGCTAGTTC TAGTGCTAGA AGTCTCAGCA TCGAAGGATG GAAAAAAGAT 11340 

GAGGAATTGT TGCTAGAATT ATCAAAGGGT GGATTTGGCC TTGTCTGGGG AACCcATCAA 11400 

AATGAGGGAG AAAGTAACCA AT ACT AT AC C TTGAATATAT CTCATAAGGT GAGTACCTAT 114 60 

CTAACAGCGG GCATTCCAGT CATTGTACCA AGTAGCTTGT CAACTGCTAA AT TT AT AGT A 11520 

GATCAAGGCT TGGGCTTTAT GGCGGATAGT CTGGAAGAGG TTCATGAGAT AGTTGATAAA 115 80 

ATGAATCTAC AAGAATATCA AG AAATGAC G AATCGTATCA AGACCTTTAG CTATTTGTTA 11640 

AAAGAGGGCT ATTTCACTAA AAAGTTATTG GTAGATGCAA TCTATCACTT GGGAATTGAT 117 00 

TAAGGGAATG AAATGAACAA AACAATTGTA CTAGCAGGGG ATCGCAATTA CACCAGGCAG 117 60 

TTAGAAACAA CGATAAAATC T AT TTT AT AC CACAATCGAG ATGTTAAGAT TTATATTTTG 1182 0 

AATCAAGATA TCATGCCAGA TTGGTTTCGC AAACCACGAA AAATAGCTCG CATGTTAGGT 11880 

AGT GAG ATT A TCGATGTTAA ACT AC CTG AA CAAACTGTGT TTCAAGATTG GGAAAAGCAA 1194 0 

GATCACATTA GTAGCATTAC T T ATG C TAG A TATTTTATTG CAGATTATAT CCAAGAAGAT 12000 

AAGGTTTTAT AT T TAG AC AG TGATTTGATT GTAAATACTT CT T T AG AG AA ATTATTTAGT 1206 0 

ATTTGTTTAG AAGAAAAATC ACTCGCAGCA GTTAAAGATA CAGATGGAAT TACATTTAAT 1212 0 

GCAGGTGTTT TATTAATCAA CAATAAAAAA TGGCGTCAAG AGAAATTAAA AGAACGACTA 12180 

ATTGAACAGA GCATTGTTAC AATGAAGGAA GTTGAAGAAG GCCGTTTCGA GCATTTTAAT 12 24 0 

GGTGATCAAA CGATTTTTAA TCAGGTCTTG CAAGATGATT GGTTAGAACT AGGTCGAGCT 12 3 00 

TATAATTTAC AAGTAGGGCA TGATATTGTG GCTTTGTATA ACAATTGGCA GGAACATCTG 123 6 0 
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GCTTTTAATG ATAAACCAGT GGTGATTCAT TTTACGACCT ACAGAAAACC CTGGACTACC 12 420 

TTG AC AG C C A ATCGTTATCG TGATTTATGG TGGGAATTCC ATGATTTGGA GTGGAGTCAG 12 4 80 

ATTTTACAAC ACCATATGGG AGAATTTGAA CTAATATCGC CTCTAGATAA GGAATTTTCT 1254 0 

TGCTTAACCT TAACGAATTC CCAAGATTTA GAAGGAATAG AAGAGCTAGT TACAGCTCTA 12 600 

CCTGAGGTGG TATTTCATAT CGCAGCTTGG ACGGATATGG GAGATAAATT AAAAAAATTA 12 6 60 

GCTGTATATA ATAATGTGAG ATTGCATCCA CAAATTGTTC CACCGGTCTT AG AT AAG CTG 12720 

AAAAAGTCAA CAAATCTATA TTTGGATATC AATCATGGTA GTGCAGATGA GAACTTTTTA 127 80 

AAATCTTTGC AAGAACAAGA AAAAACGCTA CTAGCTTTTC AATCGACTCA GCACGGAGAG 12840 

TTAGGACAAA TCGTTTTCGA AAATGGGAAA GTTTCCTTTA TGATTGATAC GATTAAAGAT 12 9 00 

TTTAAGAAAA ACGGACATCT TACCTGTTTT CGACAACTTC CAAGTTTAAC TTGTTTAACG 12 960 

TTTACGGCTT CTCAGTATAT CGAACAATTG G AT T AC T TG G CTGGACAGTT GCCAAATGTT 13 020 

GTTTTTCAAA TTGCTGCTTG GACAGCTATG GGGCCAAAAT TATATGATTT GTCTAATCGT 13 080 

TATCCTAATA TTCAGCTCTA TCCGGCAATT T C TAG AG AT A AGCTAGACGA GTTGAAGGAG 13140 

AAGATGGATG CTTATTTAGA TATCAACCTA CTGACTTCAA CATCCGATAT CGTTGCAGAA 132 00 

ATGGCTCATC T AT CTAAAC C TATACTAGCC TTTTATAAAT CTCAAAATGG GAATAATGGC 13 2 60 

CAAAGGTTGT ATTCAAGTGA ACATCCTGAA CGAATGTTGG CTGATTTGCA AAAATTGATA 13 3 20 

ACTAAGGATA TGCTAGAAAA ACCGCTTGAT ATAATCCAGG TGAAAGGGAT AGATGAAACC 13 3 80 

TTGGATTATA TTATTGAACA CAACTCTTCT TTAGTTCGTT TTGGAGATGG GGAAATCAAT 1344 0 

ATGCTTGCAG GGCATTCAAT TCCCTACCAG GATTATGATG AAGAGTTGGT TTCAATCATG 13 50 0 

AGGGACATTA TCGGCCAAGA AAGTCGAGAA GATTTAGTAG TGTGCCTTCC TGATGCTTTT 13 5 60 

AC AG AT CGT T TTAGGTTTAC ATCGTGGGCG ATTCCATTTT GGAAAGATCA CATGGATCAT 13 620 

TATATGGATT TTTACAGAGA GTTATGCAGT GATTCATGGT ATGGCTCAAC CTTTGTATCT 13 68 0 

CGCCCTTATA TCGATTTTGA AGACAAGAGT CAAGCTAAAG CTCAATTTGA AAAATTGAAA 13740 

AG C AT TTGGG AAAACCGTGA CTTACTGATA GTCGAAGGTG CGACTTCTCG TTCAGGTGTC 13 800 

GGAAATGATT TATTCGATGA GGCAAATTCT ATTAAGCGAA TTATCTGTCC TTCTCATAGT 13 860 

GCCTTTTCTA GAGTTCATGA ACTTGAACAA G AAAT T G AAA AGTATGCTGG TGGTCGCTTG 13 92 0 

ATTTTATGTA TGCTTGGACC TACAGCAAAA GTTCTGAGTT ATAATCTATG CCAGATGGGC 13 9 80 

TATCAAGTTT TGGATGTAGG CCATATTGAC TCAGAGTATG AATGGATGAA AATGGGAGCT 14 04 0 

AAAACTAAGG TTAAATTTTC T C AT AAAC AT ACTGCAGAAC ATAATTTCGA CC AAG AT ATT 1410 0 

GAATTTATTG ATGATGAAAC CTATAACAGT CAGATTGTTG CACGAATATT AAACTAGACT 14160 
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AATTAGCGGC CGAGCAGAAT CAAGTTGTTC ATCTTGGGGA TCAGAGTATT TTAAATATTT 15960 

ATTTTGAGGA TAATTGGCTA G C CT TAG AT A AAACATATAA TTATATGGTG GGTATTGATA 1602 0 

TTTATCACCT TGCTCAAGAA TGTGAACGTC TAGATGACAA TCCACCTACA ATTGTTCACT 16080 

ATGCTAGTCA TGATAAACCT TGGAATACAT ATAGTATATC TAGACTACGT GAATTATGGT 16140 

GGGTTTATAG AG AT TT GG AT TGGTCAGAGA TTGCTTTTCA ACGTTCCGAT TTAAATTATT 162 00 

TTGAAAGAAG CAATCAGTCT AAAAAACAAG TGATGCTTGT GACATGGAGT G C AG AT AT AA 162 60 

AACATTTAGA GTATTTAGTA CAACGGTTAC CTGATTGGCA TTTTCATTTG GCTGCACCGT 16320 

GTGATTGTTC TGAGGAGCTG ACCTCTCTAT CACAGTATAC GAATGTAACA GTATATCAAA 163 80 

ATGTATTACA TAGTAGAATT GATTGGCTAT T GG AC GAT T C TATAGTTTAT T TAG AT AT T A 16440 

ATACAGGTGG AGAGGTTTTT AATGTAGTTA CAAGGGCACA AGAAAGTGGC AAGAAAATCT 16500 

TCGCTTTTGA TAT C AC AC GT AAAAGTATGG ATGATGGACT CTATGACGGT ATTTTTTCTG 165 60 

TGG AG AG AC C AGATGATTTA GT GG AT AG AA TGAAGAATAT AGAGATAGAG TAATGAGTGA 16620 

ATTAATTAGT GTTGTGGTAC CGATATACAA TACGGGAAAA TAT TT AG TGG AGTGTGTCGA 16680 

GCATATTCTG AAGCAAACCT AT C AAAAT AT AGAAATTATT TTAGTTGATG ACGGTTCTAC 16740 

GGATAATTCT GGGGAAATTT GTGATGCTTT TATGATGCAA GATAATCGTG TGCGAGTATT 168 00 

GCATCAAGAA AATAAGGGGG GGGCAGCACA AGCTAAAAAT ATGGGGATTA GTGTAGCTAA 16860 

GGGAGAGTAC ATCACGATTG TTGATTCAGA TG AT AT CGT A AAAGAAAATA TGATTGAAAC 16 920 

T C T TT AT C AG CAAGTCCAAG AAAAGGATGC AGATGTTGTT ATAGGGAATT ACTATAATTA 16980 

TGACGAAAGT GACGGGAATT TTTATTTTTA TGTAACAGGG CAAGATTTTT GCGTCGAAGA 17 040 

ATTAGCTATA CAAGAAATTA TGAACCGTCA AGCAGGAGAT TGGAAATTCA ATAGCTCGGC 17100 

CTTTATATTG C CG AC AT T T A AGTTGATTAA AAAAGAATTA TTCAATGAAG TTCACTTTTC 17160 

AAATGGTCGC CGCTTTGATG ATGAAGCAAC TATGCATCGC TTTTATCTTT TAGCCTCTAA 172 2 0 

AATCGTCTTT ATAAACGATA ATCTCTATCT GTATAGAAGA CGTTCAGGAA G CAT C ATG AG 172 80 

AACGGAATTT GATCTTTCCT GGGCAAGAGA TATTGTTGAA GTGTTTTCTA AG AAAAT AT C 17 340 

GGATTGTGTC TTGGCTGGTT TGGATGTCTC CGTTCTGCGT ATTCGATTTG TCAATCTTTT 17400 

AAAAGATTAT AAGCAAACTT TAGAATACCA TCAATTAACA GATACTGAGG AATATAAAGA 174 6 0 

TATTTGTTTC AGATTAAAGT TGTTTTTTGA TGCAGAACAA AGAAATGGTA AAAGTTGAAA 17 520 

TAAAAGAATT GTTATTTACC ATATCACAAA CAATGAAGGT GAGGGGAGTG TTTTATGACT 17 580 

AAGATTTATT CGTCAATAGC AGTAAAAAAA GGACTATTTA CCTCATTTCT ACTGTTTATC 17 640 

TATGTATTGG GAAGTCGTAT TATTCTCCCT TTTGTTGACC TAAATACTAA AGATTTTTTA 177 00 
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GGAGGTTCAA CAGCCTATCT AGCCTTCTCA GCCGCCCTAA CAGGTGGGAA TCTAAGAAGT 177 60 

TTATCAATTT TTTCTGTTGG ATTATCCCCT TGGATGTCCG CCATGATTTT ATGGCAGATG 17 82 0 

TTTTCTTTTT CTAAACGGTT GGGTTTAACA TCTACGTCTA TAGAAATACA AG AT CG C C GT 17 880 

AAAATGT AC C TGACCTTGCT AATTGCTGTG ATTCAATCCT TGGCAGTTAG CTTGAGACTG 17 94 0 

CCAGTACAAT CCTCCTATTC TGCAATATTG GTTGTTCTAA TGAATACAAT ATTGCTGATA 18000 

GCAGGAACAT TTTTTCTTGT TTGGTTGTCA GATTTAAATG CGAGTATGGG GATTGGAGGT 18060 

TCTATTGTAA TCCTCCTATC CAGTATGGTT TTAAATATTC CTCAGGATGT TTTGGAAACA 1812 0 

T TT C AG AC AG TACACATTCC AACAGGGATT ATTGTGTTAC TTGCTTTATT AACCCTTGTC 18180 

TTTTCTTATT TACTTGCCCT TATGTATCGA GCTCGCTATT TGGTTCCTGT TAATAAAATT 18240 

GGCTTACACA ATCGATTTAA ACGCTATTCT TATCTCGAAA TCATGTTGAA TCCTGCAGGT 183 00 

GGGATGCCTT ATATGTATGT GATGAGTTTT CTTAGTGTAC CAGCTTATTT GTTCATCTTG 18360 

TTGGGATTTA TTTTCCCTAA TCATTCAGGG TTAGCGGCTT T AT C AAAGG A ATTTATGGTT 1842 0 

GGAAAGCCTT TGTGGGTCTA TGTTTATATT TCGGTCTTAT TTTTATTTAG TATCATTTTT 184 80 

GCTTTTGTTA CGATGAATGG AGAAGAGATT GCAGACCGTA TGAAAAAATC TGGAGAATAC 1854 0 

ATTTATGGTA TTTATCCAGG TGCGGATACT AGTCGATTTA TTAATCGATT GGTCCTTCGT 18 600 

TTCTCAGTCA TAGGTGGTCT CTTTAATGTG ATTATGGCAG GTGGTCCCAT GCTTTTTGTT 18660 

TTGTTTGATG AAAAGTTATT ACG AT TGGC A ATGATTCCAG GCTTATTTAT GATGTTCGGG 1872 0 

GGCATGATTT T T AC G ATT AG AGACGAGGTC AAGGCTTTAA GGCTAAATGA G AC CT AT AG A 18780 

CCTTTGATTT AGGAGACTTT TATGTATTAT TTTATTCCAG CTTGGTATGG GT C AG AAAG A 18 840 

ACATGGCATG CAGATATCAC TCCATGGTAT TTTTCTCATT TTCGTCTAGA GTTTGATGAT 18900 

ACCTTTCACC AGATTCGGCT CTTTCAAGAG CAAGATATAG ATTCTCGTCT ATTAGTATTA 18960 

GCTTACCAGC CTCATCTACG TTATTTTTTA TATAGACATG GTGTGTTAGA AATGGATACT 19020 

TATTCCGTTT TTGATGTTAT GCAAGATTTT CATAATCTCC ATACCCAAGT TTTAAGCATT 19080 

AGAGATATTG AGTGGGATGA TGACTGTGAA TTTATTTATA GTCCCTTTAC GAT T AT CGTT 1914 0 

CAAAAAAATG GGAAGAAATT TGCTAAGGTT GAACATGGAG TTGAAGGCTT CATCAGTGAT 192 00 

ATACAGTATT TTG AAC C AAA TGGTCAAATA CATATGCACC ATATCGTGGA TGATCGTGGG 192 60 

TTTGTATCGA GCATTATCTT TTTTGAAGAT GGGCAAGCAG CCTATCAAGA ATATCTGAAC 19 3 20 

CTCAAGGGAG AGTGGCAATT CAGAGAGCGT TTAAAAGAAG GAGGACAGGT AGAAGTCAAT 19380 

CCAATTTTGG GTTATCGCTT TAAAATGCTT ACCTATCAAA ATATGGGAGA TCTGGTGGCA 19440 
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TTTTGCAGGT TATCGACCTG CAGAAGGATT CGAGGGTTAC TTTATGATGA AAACTCTTGG 213 00 

ATGTCCCTTC ATTTTATTTT CTGATCCACG TTTAGAGGGG GGAGCTTTTT ATCTCGGAAC 213 60 

GGATGAGCTA GAGGGAAAAG TAAAGGATAC GATCACTCAC TATCTTGATT ATTTAGGCTT 21420 

TGATCATAAG GATTTGATTT TATCAGGTCT TTCTATGGGA ACGTTTCCGG CTCTCTATTA 214 8 0 

TGGTGCTTCT TTTGAACCCC ATGCCATCAT AGTTGGTAAG CCCTTGGCTA ATTTAGGAAC 21540 

TATAGCTAGT CGTGGACGTT TGGACGCACC GGGTGTCTCT AACTTAGCTT TTGATTGTTT 21600 

AAT T CAT CAT ACAGGTGGGA CAAGTTCTCA AGATATGACG GAGTTGGATC AGCGTTTTTG 21660 

GAAAATTTTT AAACAAGCAA ATTTTTCAAA GACAACCTTT GGTTTATCCT ATATGAAAGA 21720 

TGAAGAAATG GATCCACAAG CCTATGAACA ATTAGTGTCT TATCTGTGTA AT AC AGGT G C 217 80 

GAAGATTTTA TCTAAAGGAA CTGCTGGACG ACACAATGAT GATACAGATA C C AAT ATT T C 21840 

TTGGTTTTTG CACTTTTATA GAATGGTCTT AGAGACTGGT TTTGGAAGGG AGAAAAGATG 21900 

ATTATTACAC AGAGACAGTC TATTCATTGG GGAGAAGTTG GTGGGACTTA TATGTATGGA 219 60 

ACAACTGTAT CTTATTACCC TGACAAAAGT GTTCGTCTGT ATAATCCTCT ATTGCCATCT 2 2 020 

GGTGAGATTC TAAAGACTTG GTTTTCTAGT GTCAATTACC AGGCTGCACG AACCCAACCT 22 0 80 

CAGCTTCCCT TATTAAAAAG AAAGCAGGAG TATCAACTAT CACTGGTTTT TGACTGTCAG 2 214 0 

CCTGAAAATG GAGTTTATAC CAAGATAACT TTTTTTGACC GCTATGGTGA TATTTTAGAA 22 2 00 

AAAAAGGTAG AAAAAGTGAA AGATTTCATA TTTACTTATC CAGAAGATAG TTATACTTAT 222 6 0 

CGAGTTTCTC TTTTAAGTGC TGGATTTGAG TCCTTAACTT TTTATCATTT TTCTATCAAG 2 2 320 

G AG AT C AG AA GTGTTTAGAC GTTTAGGTCA AGATTTCCAG CTTAGGAAAG TGAAAAAGAT 2 2380 

TTTAAAGCAG ATTAATGCCC TGAAAGGCAA GATGTCCTCT CTTTCGGATC AAGAATTAGT 22 44 0 

AG C T AAAAC A GTAGAGTTTC GTCAGCGTCT TTCCGAGGGA GAAAGTCTAG ACGATATTTT 2 2 500 

GGTTGAAGCT TTTGCTGTGG TGCGTGAAGC AGATAAGCGG ATTTTAGGGA TGTTTCCTTA 22 5 60 

TGATGTTCAA GTCATGGGAG CTATTGTCAT GCACTATGGA AATGTTGCTG AGATGAATAC 22 62 0 

GGGGGAAGGT AAGACCTTGA CAGCTACCAT GCCTGTCTAT TTGAACGCTT TTTCAGGAGA 22 6 80 

AGGAGTGATG GTTGTG AC T C CTAATGAGTA TTTATCAAAG CGTGATGCCG AGGAAATGGG 2 2740 

TCAAGTTTAT CGTTTTCTAG GATTGACCAT TGGTGTACCA TTTACGGAAG ATCCAAAGAA 22 8 00 

GGAGATGAAA GCTGAAGAAA AGAAGCTTAT CTATGCTTCG GAT AT C AT CT ACACAACCAA 22 8 60 

TAGTAATTTA GGTTTTGATT ATCTAAATGA TAACCTAGCC TCGAATGAAG AAGGTAAGTT 22 92 0 

TTTACGACCG TTTAACTATG TGATTATTGA TGAAATTGAT GATATCTTGC TTGATAGTGC 22 9 80 
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TCTACAACAG 


CTATCCATGG 


CTATCGGTGG 


24540 


TCAATCTGCT 


AGTCAGAAAA 


ATCCAATCGT 


AGAGTACTAT 


CAAGAAGCCT 


ACGCGGGCTT 


24600 


TGAAGCTATG 


AAAGAACAGA 


TTCATGCGGA 


TATGGTGCGT 


AATCTCCTGA 


TGGGGCTGGT 


24660 


TGAGGTCACT 


CCAAAAGGTG 


AAATCGTGAC 


TCATTTTCCA 


TAAAAGGAGA 


AAATATGACA 


24720 


ATTTACAATA 


TAAATTTAGG 


AATTGGTTGG 


GCTAGTAGCG 


GTGTTGAATA 


CGCTCAAGCC 


24780 
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TATCGTGCTG GTGTTTTTCG GAAATTAAAT CTGTCCTCTA AGTTTATCTT TACAGATATG 24 840 

ATTTTAGCCG AT AAT ATT C A GCACTTAACA GCCAATATTG GTTTTGATGA TAATCAGGTT 2 4 900 

ATCTGGCTTT AT AATC AT TT C AC AGATAT C AAAATTGCAC CTACTAGCGT GACAGTGGAT 249 60 

GATGTCTTGG CTTACTTTGG TGGTGAAGAA AGTCACAGAG AAAAAAATGG CAAGGTTTTA 2 5020 

CGTGTATTCT TTTTTGACCA AG AT AAGT T T GTAACCTGTT ATTTGGTTGA TGAGAACAAG 2 5080 

GACTTGGTTC AACATGCCGA GTATGTTTTT AAGGGAAACC TGATTCGGAA GGATTACTTT 2 5140 

TCTTATACGC GTTATTGTAG CGAGTATTTT GCTCCCAAGG ACAATGTTGC AGTCTTATAC 2 52 00 

CAACGAACTT TTTATAATGA AGACGGGACT CCAGTCTATG ATATCTTGAT GAATCAAGGG 2 5260 

AAGGAAGAAG TTTATCATTT CAAGGATAAG ATTTTCTATG GAAAGCAAGC TTTTGTGCGT 2 5320 

GCCTTTATGA AATCTTTGAA TTTGAATAAG TCTGATTTGG TCATTCTCGA TAGGGAGACA 2 53 80 

GGTATTGGAC AGGTTGTGTT TGAGGAAGCA CAGACAGCAC ATCTAGCGGT AGTTGTTCAT 2 5440 

GCGGAGCATT ATAGTGAAAA TGCTACAAAT GAGGACTATA TCCTTTGGAA TAACTATTAT 25500 

GACTATCAGT TTACCAATGC AGATAAGGTT GACTTCTTTA TCGTGTCTAC TGATAGACAA 2 5560 

AATGAAGTTC TACAAGAGCA ATTTGCCAAA TATACTCAGC ATC AG CC AAA GATTGTTACC 25 620 

ATTCCTGTAG GCAGTATTGA TTCCTTGACA GATTCAAGTC AAGGGCGCAA ACCATTTTCA 2 568 0 

TTGATTACGG CTTCACGTCT TGCCAAAGAA AAGCACATTG ATTGGCTTGT GAAAGCTGTG 2 5740 

ATTGAAGCTC ATAAGGAGTT AC CGG AACT A ACCTTTGATA TCTATGGTAG TGGTGGAGAA 2 5 800 

GATTCTCTGC TTAGAGAAAT TATTGCAAAT CATCAGGCAG AGGACTATAT CCAACTCAAG 25860 

GGGCATGCGG AACTTTCGCA GATTTATAGC CAGTATGAGG TCTACTTAAC GGCTTCTACC 2 5920 

AGCGAAGGAT T TGGTCTG AC CTTGATGGAA GCTATTGGTT CAGGTCTACC TCTAATTGGT 25980 

TTTGATGTGC CTTATGGTAA TCAGACCTTT ATAGAGGATG GGCAAAATGG TTATTTGATT 2 6040 

CCAAGTTCAT CTGACCATGT AGAAGACCAA ATCAAGCAAG CTTATGCCGC TAAGATTTGT 2 610 0 

CAATTGTATC AAGAAAATCG TTTGGAAGCT ATGCGTGCCT ATTCTTACCA AATTGCAGAA 2 6160 

GGCTTCTTGA CCAAAGAAAT TTTAGAAAAG TGGAAGAAAA CAGTAGAGGA GGTGCTCCAT 2 6220 

GATTGAACTT TATGATAGTT ACAGTCAAGA AAGT CG AG AT T T AC ATG AAA GTCTAGGCGC 2 6280 

TACTGGTCTT TCTCAACTTG GAGTGGTCAT CGATGCAGAT GGTTTTCTGC CTGATGGTCT 26340 

GCTTTCTCCT TTTACCTATT ATCTAGGTTA CGAGG AT GG A AAACCTCTCT ATTTTAATCA 2 64 00 

AGTTCCCGTT TCAGATTTTT GGGAAATTTT AGGAGATAAT CAGTCTGCTT GTATTGAAGA 2 6460 

TGTG AC GC AG GAGAGGGCTG TCATTCATTA TGCTGATGGA ATGCAGGCTC GCTTGGTTAA 2 6 520 



WO 98/18931 



PCT/US97/19588 



590 

ACAGGTAGAC TGGAAAGACC TAGAAGGTCG AGTACGTCAG GTTGACCACT ACAATCGCTT 2 6 580 

CGGAGCTTGT TTTGCTACAA CGACTTATAG CGCAGATAGC GAGCCGATTA TGACAGTTTA 2 6 640 

CCAAGATGTC AATGGTCAAC AAGTTTTACT GGAAAACCAT GTGACGGGTG ATATCTTATT 2 67 00 

GACTTTGCCA GGTCAGTCCA TGCGTTACTT TGCAAATAAA GTTGAATTTA TCACCTTCTT 2 67 60 

TTTGCAAGAT TTGGAAATAG ATACCAGTCA GCTTATCTTT AATACTCTAG CGACTCCTTT 2 6 820 

CTTGGTTTCC TTCCATCATC CAGATAAATC TGGCTCGGAT GTCTTGGTAT GGCAGGAACC 2 6880 

TCTCTATGAT GCCATTCCAG GTAATATGCA GTTGATTTTG GAAAGTGATA ATGTGCGTAC 2 69 40 

TAAGAAGATC ATCATTCCAA ATAAGGCGAC TTATGAGCGC GCTTTAGAGT TAACTGACGA 2 7 000 

GAAATACCAT GATCAGTTTG TGCACTTGGG T TAT CAT T AC CAGTTCAAAC GTGATAATTT 2 7 060 

CCTAAGACGA GATGCCTTAA TCTTGACCAA TT C AG AT C AG ATTGAGCAAG TAGAAGCAAT 27120 

CGCAGGAGCC TTGCCTGATG TCACTTTCCG TATTGCAGCG GTGACAGAGA TGTCTTCTAA 2 7180 

GCTCTTAGAC ATGCTTTGCT ATCCTAATGT GGCCCTTTAC CAGAACGCTA GTCCACAGAA 2 7240 

GATTCAGGAG CTGTATCAAC TGTCGGATAT TTACTTGGAT ATAAACCACA GTAATGAGTT 273 00 

GCTACAGGCA GTGCGTCAGG CCTTTGAGCA CAATCTCTTG ATTCTTGGCT TT AAT C AG AC 273 60 

GGTGCACAAT AGACTTTATA TCGCTCCAGA C CAT C T ATT T GAAAGTAGTG AAGTTGCTGC 2 742 0 

TTTGGTTGAG ACCATTAAAT TGGCCCTTTC AGATGTTGAT CAAATGCGTC AGGCACTTGG 27480 

CAAACAAGGC CAACATGCAA ATTATGTTGA CTTGGTGAGA TATCAGGAAA CCATGCAAAC 2 7 540 

TGTTTTAGGA GGCTAACATG TCAGAGGAAG ATTTATTTTA CAAAGACGTT GAAGGCCGCA 27 600 

TGGAAGAGTT GAAACAAAAA C C CAT C AAG A AGGAAAAAGA AACCCGAGGG G AAAAG AT T A 276 60 

GTAAGACTTT TTCACTTTTA CTGGGTTTGA TGATTCTGAT TGGTTTGCTC TTTACTTTGC 2 772 0 

TGGGAATTTT GAGGTAGATC TATGATTGAA AT ACT AAT TG TTTTAGCTAT TATCCTATCT 2 7780 

CTTGCTTTGA TTGTATTGGT AACTATACAA CCCCGTCAAA ATCAACTATT TTCCATGGAT 2 7 840 

G C C ACT AGT A AT AT TGGT AA AC C AAGCT AC TGGCAGAGCA ACACCTTGGT CAAGGTGCTC 27900 

ACTTTATTGG TGAGTTTGGC TTTATTTATT CT AC T ATT AA CCTTTATGGT GATTACTTAT 279 60 

AAATAAAAGA AAACTTCAGA TATTCACCTT TTGTGGATTG GTCTGAAGTT TTCTTTTTTA 2 802 0 

TACTCAATGA AAATCAAAGA GCAAACTAGG AAGCTAGCCG CAGGCTgCTC AAAACACCGT 2 8080 

TTTGAGGTTG TAGATATAAC TGACGAAGTC AGCTCAAAAC ACCGTTTTGA GGTTGTAGAT 2 8140 

ATAACTGACG AAGTCAGCTC AAAACACCGT TTTGAGGTTG TGGATAGAAC TGACGAAGTC 2 82 00 

AGCTCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAGTCAGCTC AAAACACCGT 282 60 

TTTGAGGTTG TGGATAGAAC TGACGAAGTC AGCTCAAAAC ACCGTTTTGA GGTTGTGGAT 2 8320 
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AGAACTGACG AAGctCAGTA ACATATATAC AGCAAGGCGA CGCTGACGTG GTTTGAAGAG 2 8380 

TATTACTGTC TATATTTTTG GTAAAAATCA ACTTTTACTT GGATGAAGGT TTTGGCTTCA 28440 

CGTAGGAGTT GAAGAAGGGT GGCGCGGGTT TCAAATTCTT CTCTTGTCTT GGGCAGACTG 2 8500 

CGGTTCCGGA AGACTTCCAG ATAACGTTCA ATTTCATCTA GCAAATCAGA AGCAGGATTG 2 85 60 

GTCTGGCTCA GTTGACCTGC AATTTTTGAA AAGAGTTGCG CTAAGATCAG GCTTTCACTG 2 8 620 

GCGGCAAGGT GACAAGTGTT AATCTGTTGG GCCATGTTTC T C AGG AT AC G ACTTTGTCGC 2 8680 

TGTCTCATCT CAAAGTAGTG GATATGGTAG TCTGTCTGGT GAAAGAGGTG GTCAGAGTGA 2 8740 

TCCAAATAGA CCAGTCTGAG GGCTTCTTTC AAAAGCGTGT CTAATTCTGC TACCAGCTGT 2 8800 

GCTCGGTTGC GTCCGTCTCC TCTGGATAAA TAGTATTTGA AGCGCTGGAG GATATCTTTT 2 88 60 

AACTTTTCTT CCACCAGCGT GTGGTAGTGC TGGATTTCCT CTTCTCGTGA AGGCATATAG 28920 

AGATTAACAA GCAAGGCAAA TCCTGTACCA ATAGCAAAGA GAAGGAATTC ATTGACTAGA 2 8980 

AGGTCTGGAG AGGTTGACTC TTGAACCAAG AGATGGCTAA CCAAAACAGT GCTTGGTGTG 2 9040 

ATGCCAATTT CCCAGCCCAT CTTGTAGGCT AAAGGAACGT AGAAGGCCAG AT AG AGG C CG 2 9100 

AGACTCCAGA TATGAAATCC GCTCAAGTGA AAAGCTAGAA CACCGATAGC CAGAGCTAGA 29160 

AG CAT AG AAA AAAGACGATT GCGAGCCAGT TTTAAAGTAC TTCTACGCGT ATCAGATAGG 2 922 0 

CTCAAGAGAG CGATAATTCC AGCCGAAACT GCTGACGAAA GAT T GAG AAA ATAAGCAAGC 2 9280 

AGGCAGGCAA G AC AGGT AG C TAAGATGAGC TTGGTCGTAC GTTGGCTAAT AGACATAAGA 2 9340 

ATTTCCTAAT AAGTTAGAAT AAAAGCGTAA AAGACAAGAC ATGAGCAGGC TTGCCTTGAT 2 9400 

GAGTTATTTT TTACGGGTTG CTGCGTATTC GGCAACGGCG GTAAAGAGGA CATCTGTAGA 29460 

AGAGTTAAGG GCTGTTTCAC ATGAGTCTTG GATGACACCA ATCACAAAAC CAACCCCAAC 2 9 520 

AATTTGTATG GCAATATCGT T AG AAAT AC C GAAAAGGCTA CAAGCAACTG GGATAAGAAG 2 9 580 

GAGGGAACCT CCGGCAATAC CTGAAGCATC ACAGGATGAG ATAGCTGCTA CCACACTGAG 2 9640 

GACAAAGGCT GTGGCAAAGT CAACAGGAAT TCCAAGAGTG TTAACTGCAG CAAGGGTCAA 2 97 00 

AAGGTTAATG GTAATCGCTA CTCCAGCCAT ATTGATAGTA GAACCGAGTG GGATAGAAAC 2 9760 

AGAATAGGTA TCTGGGTTGA GTCCAAGGTC ATGGCAGAGT TTCATGTTGA CAGGAATGTT 2 9 820 

AGTCGCAGAA CTACGAGTGA AAAAGGCTGT CACACCGCTG ACACGGAGGC AGTTCCAAAC 2 9880 

TAGAGGGTAA GGATTGCGTC T C AT AAAGAA GAAGGCAATC AAAGGGTTGA CCACAGGGGC 2 9940 

AACAAAAAGC ATAGTCGTTA CTAATAGAAC CAATAAAATA CCGTAGTTGG CAAGGCTTCC 3 0000 

GACTCCCTTG TCAGAAATGG TTTTAAAAAC AAGACCAAGG ATTCCAAATG GAGCCAGATT 3 0060 



WO 98/18931 



PCT/US97/19588 



592 

GATGATCCAT TCGACAATTT TAGAAGTCAC GTCAGCGATA GTTTTTAGCA ATTCTTGACT 3012 0 

ATTTTTACTG GCTTCTCTCA TAGCGATTCC AAAAAT G AC T GCCCAAGATA AGATTCTAAT 3 0180 

ATAGTTAGCA GTAAGCAGGG CGTTGACTGG GTTGTCAACC AGTTTGAGCA AGAGGTTGCT 3 0240 

GAGAACCTGC CCAATCCCAT CTGGTGGTGC AATTTCAGTA TTGGCACTAT TTGGGGTAAT 30300 

TTCAATAGGG AC GAT G AAAT TTGCTAGTAC AGCTACAAGA GCAGCGGCGA AAGTCCCTAT 3 03 60 

CATAGGATAT ACAAGAAAAC AACAGTTTTC ATATTGCTAT CTTGTCCCTT TTGATGTTGG 3 0420 

GAAAGGGCAT TGGCAACGAG AGCAAAGACT AGGATAGGAG CAACAGCTTT TAGACCTCCA 30480 

ACGAATAAAT CCTCGAGTAG CCCAATCCCT GAGAGATTAG GAAGGGTCAG TCCTAGGATT 3 0540 

CCCCACAAAG CATACCAATC AAGATACGCT TGACAAGGCT TGCCTTATTC CAAGCATGAA 30600 

TGATTCTTTT CATAATAATC TCCTTTTTGT GTAGTGATTA TGATTATAGT ATAAATGATA 3 0660 

GACAAAATCA AGAATTTTCT GTCTATTTTT TGAATATTTA TGGAGAATGA GACTGATGAA 30720 

AATATGGTAT AATGAAATAA AGGAGTTTTA TATGCAAAAA TT T ATT C AG G C TT AT ATT G A 3 0780 

AAAG C TAG AT GTGACAACCA TTATCGAGAA TATTCTAACC AAGGTCATTT CTCTTTTACT 30840 

GCTTTTAATT GTATTTTATA TTGCTAAAAA AATGCTTCAT ACCATGGTGC AGAGAATTGT 3 0900 

CAAACCTTCT CTAAAAATGT CTCGTCATGA TGTTGGACGC CAAAAAACCA TCTCACGTTT 3 0960 

AC T AG AAAAT GTGTTTAATT ATACGCTATA TTTCTTTTTA CTCTACTGCA TTTTGTCGAT 3102 0 

TTTAGGTTTG CCAGTTTCTA GTTTGCTGGC TGGAGCTGGT ATTGCTGGGG TAGCGATTGG 31080 

TATGGGAGCC CAAGGCTTTC TGTCTGATGT CATCAATGGC TTTTTCATCC TCTTTGAACG 31140 

TCAACTGGAT GTGGGAGATG AGGTCGTTCT GACAAATGGA CCGATTACTG TATCGGGTAA 312 00 

GGTTGTCAGT GTGGGAATTC GTACGACACA GCTTCGTAGC GAGGAGCAAG CCCTTCACTT 312 60 

TGTCCCTAAC CGAAATATCA CAGTTGTTAG CAATTTCTCA C GC AC AG AC T AGACCTGTTA 31320 

TTTTAAGTAA TTTGTGGTAC AATAGAGGGA GTTTAATAAG GAGAAAAGAT GGTTTTAGAA 31380 

AAGCAGTTGG GCAATGGTTG TACCTGGATA GACCTAGACC TAGGAAAGTT GAATAAACTA 31440 

GAAGACCTTT CTGAAATTTA CGGTTTGGAC AAGGAAACCA TTGAATACGC ACTGGATAGA 31500 

AACGAGCGCG CCCACATGGA CTACCACCGT GAAAGTGAGA CGGTTACCTT TATCTATAAT 31560 

GTCTTAGACG TAAAAAAGGA CAAGGCCTAC TATGAGACTT TTCCCATGAC CTTTATTGTC 31620 

GAGCATCGTC GCCTGATTAC CAT T AGT AAT ACCAAGAACG CCTATGTCAT TGAACAGATG 316 80 

ACTCGTTATC TGGAGAACCA TGACACGCTT TCGATTTATA AGTTTCTCTT TGCCAGTCTG 31740 

GAAATC AT C A GCAATGCCTA CTATCCTGTC ATTGAGCAGA TGGACAAGAG TAGGGATGAG 31800 

GTCAATGACC TCTTGCGCCA GCGAACTACC AAGAAAAACC TCTTTGTCCT GTCTGATTTG 318 60 
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GAGACTGGTA TGGTTTATCT GACGGCAGCT GCCAAACAAA ATCGGATTTT GTTAGAGCAT 31920 

ATTCAAGGTC ATGCCTTGTA TCGTAGTTTT GATGAGATTG AGAGAGAACA GTTTGATGAT 319 80 

GCCATGATTG AGGCTCATCA GCTGGTATCC ATG AC AG AC C TAATCTCTCA GATTTTACAG 32 040 

CAGCTTTCAG CCTCTTACAA CAATATTCTA AACAATAATC TGAATGACAA TTTGACAACC 3 2100 

TTGACTATCA TTTCAGTCTT GCTAGCTGTT TTGGCAGTCG TGACAGGCTT TTTCGGAATG 32160 

AATGTTCCCT TACCTTTAAC AGATGAGCCC CATGCTTGGC TCTATATCAG TTTGGCTAGT 3222 0 

GCAGGTTTGT GGATTGTTTT AT CCTTGTT A CTAAGGAAAA TTGCGAAAAA AAGTTAAGAA 322 80 

AAGGAGCCAG AATGGCGATT G AAAAT TATA TACCAGATTT TGCTGTGGAA GCAGTCTATG 32340 

ATCTGACAGT CCCAAGCCTG CAGGCGCAGG GAATAAAGGC TGTTTTGGTC GATTTGGATA 3 2400 

ATACCCTCAT TGCTTGGAAC AACCCTGATG GAACGCCAGA GATGAAGCAA TGGCTACATG 3 2460 

ACCTTCGGGA CGCGGGTATT GGCATTATCG TAGTGTCAAA TAACACCAAA AAACGCGTTC 3 2 520 

AACGAGCAGT TGAGAAATTT GGGATTGATT ACGTTTACTG GGCCTTGAAG CCCTTCACAT 3 2 580 

TTGGTATTGA CCGTGCTATG AAGGAATTCC ACTATGACAA AAAGGAAGTG GTCATGGTTG 32 640 

GTGACCAACT CATGACAGAT ATACGAGCAG CCCACCGTGC AGGGATTCGG TCAATTTTAG 32700 

TCAAACCCTT GGTCCAACAT GACTCAATCA AAACGCAGAT TAACCGAACT CGTGAGCGTC 32760 

GTGTTATG 32 7 6 8 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CCAGTCACAA AGAAATTGAG CGCGTTCAGc TGAGGATGCA CTATGATGCA AGCTACATTT 6 0 

CATTTGATGG GATATTAAGA AAGGAGATTT TCATGACACT TTTAGATGTA AAACACGTTC 12 0 

AAAAAATTTA TAAAACACGT TTTCAGGGCA ACCAAGTAGA AGCCCTCAAG GATATTCACT 18 0 

T T AC CGTAGA AAAGGGTGAC TACGTTGCCA TCATGGGTGA GTCTGGTTCT GGTAAATCAA 240 

CTCTTCTCAA TATTCTAGCT ATGTTGGATA AACCAAGTCG TGGTCAGGTT TACTTGAATG 3 00 

GAACTGACAC CGCAACTATT AAAAATTCAC AGGCTTCTAG TTTCCGGCGT GAAAAGCTAG 3 60 

GATTTGTCTT CCAAGACTTT AACTTGCTAG ATACTCTGTC TGTTAAGGAC AATATCTTGC 42 0 
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TTCCGCTTGT CTTGTCAAGA AG AC C T AT AA CGGAGATGAT GAAGAAATTG GTGGTGACAG 4 80 

CTGAGAATCT GGGTATTAAC CAATTGCAAG AG AAGT AC C C TTACGAGATT TCTGGTGGTC 54 0 

AGAAACAGCG TGTAGCAGTA GCCCGCGCCA TCATCACAGA ACCTGAAATT CTCCTTGCGG 600 

ACGAGCCAAC AGGAGCCCTT GATTCCAAGT CATCTGCAGC C T T ACT TG AT GTCTTTAATG 660 

AAATCAATGA GCGTGGGCAA ACCATCCTCA TGGTAACCCA CTCAACAGCA GCTGCTAGCA 72 0 

GGGCCAAGCG TGTTCTCTTT ATCAAAGACG GCATTCTTTA CAACCAAATC TACCGTGGAG 780 

AGAAGACAGA GCGTCAGATG TTCCAAGAAA TCTCTGATAC CTTGACTGTC ATGGCAAGCG 840 

AGGTGAATTA GTATGTTTCG ATTAACCAAT AAGTTAGCGG TATCGAACTT GATTAAAAAC 900 

CGCAAACTCT ACTATCCCTT TGCACTGGCT GTTCTCTTGG CAGTCACCAT CACCTATCTC 960 

TTTTACTCCC TAACCTTCAA TCCAAAGATT GCGGAAATCC GTGGAGGAAC C AC C ATT C AA 102 0 

GCAACACTTG GATTTGGTAT GTTTGTCGTT ACCCTTGCGT CAcCATTATC GTCCTCTATG 108 0 

CCAATAGTTT TGTCATGAAA AACCGTTCCA AGGAACTGGG T AT AT ATGG C ATGTTAGGCT 1140 

TGGAGAAGCG CCATCTAATC AGTATGACCT TTAAGGAGTT AGTGGTATTT GGGATTCTAA 12 00 

CTGTTGGAGC GGGTATCGGT ATTGGAGCCT TGTTTGACAA GTTAATTTTC GCTTTCCTGC 12 60 

TCAAACTAAT GAAACTGAAG GTTGAGCTGG TTGCTACCTT CCAAATGAAT GTTGTCATTG 132 0 

CAGTACTTGT TGTCTTTGGA TTGATTTTCC TAGGCCTCAT GTTCCTGAAT GCTCTTCGAA 13 80 

TCGCCCGTAT GAATGCCCTC CAGCTCTCGC GTGAGAAAGC AAG CGG AG AG AAAAGAGGTC 1440 

GCTTCCTACC TCTCCAAACG ATTCTTGGTT CCATAAGTTT AGGGATTGGC TATTATCTTG 1500 

CCCTTACGGT AACCGATCCT CTTACAGCCC TAACAACTTT CTTCCTAGCT GTTTTGCTGG 15 6 0 

TTATCTTTGG TACTTATCTA TTGTTTAATG CAGGGATTAC AGTCTTCCTA CAAATCTTAA 162 0 

AGAAAAACAA GAAATACTAT TACCAACCTA ATAACCTCAT ATCTGTTTCC AAC T TG AT T T 1680 

TCCGTATGAA GAAAAATGCG GTT GG AC TAG CAACCATCGC TATTTTGTCA ACAATGGTTT 1740 

TGGTAACCAT GTCAGCAGCG ACAAGCATTT TCAATTCCGC AGAAAGCTTT AAAAAAGTTC 1800 

TAAATCCTCA TGATTTTGGG GTTTCAGGGC AAAATGTTGA AAAAGAAGAT TTGGACAAAC 1860 

TCTTGAGCCA GTTTGCAAGT GACAAAGGTT ATAGTGTCAA AGAGAAAGAA GTACTTCGTT 1920 

ACAGTAACTT TGGTATTGCA AATCAAGAAG GAACCAAGTT AACTATTTTT GAAAAAGGAC 1980 

AAAACCGTGT CCAACCCACA ACAGTTTTCA TGGTATTTGA C C AAAAAG AT TATGAAAATA 2 040 

TGACTGGTCA AAAACTGTCT CTATCAGGAA ATGAGGTCGG TCTCTTTGCC AAAAATGACG 2100 

GACTGAAAGG ACAGAAAGCT CTAACTCTAA ATGATCATCA ATTTTCTGTC AAAGAAGAAT 216 0 

TTAATAAAGA TTTCATTGTG AACCATGTTC CAAATAAGTT TAATATCTTG ACTACTGATT 222 0 
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ACAATTACCT TGTTGTTCCT GATTTACAAG CCTTTTTGGA TCAATTCCCA GATTCGGCTA 2280 

TCTATAATCA GTTTTACGGT GGTATGAATG TAAATGTCAG TGAAGAAGAA CAACTCAAGG 2 340 

TCGCTGAGGA GTATGAAAAC TACCTCAATC AATTTAATGC TCAATTAGAC ACAGAAGGTA 2400 

GCTATGTTTA TGGTAGCAAT CTAGCAGATG CTAGTTCTCA GATGAGTGCC CTCTTTGGTG 2 4 60 

GTGTCTTCTT TATCGGTATT TTCCTATCCA TTATCTTTAT GGTCGGAACT GTTCTGGTCA 2 52 0 

TCTACTACAA ACAAATTTCT GAAGGCTACG AAGACCGTGA ACGCTTTATT ATCTTGCAGA 2580 

AAGTCGGTTT GGACCAAAAG CAAATCAAGC AAACCATCAA CAAACAGGTT TTAACTGTTT 2 64 0 

TCTTCCTTCC TTTGCTCTTT GCCTTCATAC ATCTCGCCTT TGCCTACCAT ATGCTTAGCC 2700 

TGATTTTAAA AGTGATTGGT GTACTGGATA CGACTATGAT GTTGATTGTG ACCTTGTCTA 2 7 60 

TCTGCGCTAT CTTCCTCATC GCCTATGTGC TGATTTTCAT GATTACTTCA AGAAGT TAT C 282 0 

GCAAGATTGT GCAAATGTAA AAAAGATACC TCGACTTCAA AATCGAGGTA TTTCTTGTAT 2 880 

TCTAAATGCT GAAAAGTTGT CCGAGCAGGA AGGTAACTCC CATGGTCAAG AGACCAATAG 2 94 0 

CAAGGTTCCG AATCATAGCT GTTTTGGTTG GGGCTTTTCC AAGT C TAG C A CTTGTGTAAC 3 000 

CAGTGAGAAG AAGGGCCACA CCGACAATAA GGACGGTAGC AGGGATGCGG TAATCACTTG 3060 

GAAAAATGGT CACTGACAGC ATTGGAGGCA AACTTCTAAG GAAAAAGGCA ACGAAGCTAG 312 0 

AAATGGC AG C GTGCCAAGGA TTGGTAAATT CTTC AT AC T C AATCCCATAT TTTTCCTCTA 3180 

CCAGAGCCTT GAGTGGATTT TTAAGAAAGA TCTTATTGGT CAAGAGTTGG GCAGAAGTTT 324 0 

TGAATTCTCC ATTTTGGATA TAAGCAGCAT AGAGGGATTT TTTGGCTAGT TCCCTATCTT 33 00 

GGTCTAGCAA GAGTTTTTCT CGCGAAACGG CAGCTTCCTC GGTATCTTTT GGAGTTGAAA 33 60 

CGGATACATA TTCTCCACCA GCCATTGAAA AGGCACCAGC T AAG AT AGC C GTAAAACCTG 3420 

ATAAAAAGAT AAT C C AG AT A TTGGTCGTGG CACTGGCAAC TCCGATAACC ACACCAGCAA 3480 

TGGAAATAAT TCCATCGTTA GCATCAAGAA CACCCGCACG CAGGATATTT AAACGACCTG 3 540 

CAAAATTTGA ATCAATTTCG TGATTTGTTT CTGACGCTAA ATTTCAAGTT CAAGTTAGCC 3 600 

AT C AAGAAGT CTTCTCTGGG TGACTTGTAG TCCAAGCATT TTTTAGGATA GTTGTTAATC 3660 

CACTTTTCGA TGAATGCGAC TTCTTTGGGA GTCATTTTCT TGGTTCCCTT AGGTAACCAT 3 72 0 

CTACGAATGA GCCTGTTGTG ATTCTCATTA GTTCCCCTTT CCCAAGAGGC ATAGGGATGT 37 80 

GCATAATAAA TGTGCTCCTC AGAAAATACA TTAGACAAGC GATTGAATTC CGTTCCATTA 3 840 

TCTGCCGTGA T GGAAAG AAT CTTGTGTTGT TTTAAGATGA GTTTTAGAGC CTGATTGACC 3900 

ACATCAGCAC TTTTATTTGG AATCAATCGG ATGATCTGAT GTCTACTTTT TCGATCCGTC 3 9 60 
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AAGACAAGCA 


AGCAGTAGTT 


TTTCGCTCTC 


GTAAGTAGAA 


CTGTATCAAT 


CTCATAATGC 


4020 


CCATTCTCCA 


AGCGAAAATT 


GAT AG C TT C A 


AGCCGCTGTT 


CGATGGATTG 


ACCAGCAGGT 


4080 


TTAAAGTTGG 


TGCTGGCCTG 


TTTCTTAAGC 


GCTTTTCCTT 


TTCTAGGGTA 


AAGCAGATCC 


4140 


TGTTTGCTTA 


ACCCCAATTT 


TCCATGATGA 


ATCCAATAGT 


AAATGGTTGA 


AATTCCCACG 


4200 


TTAACCCCTT 


TAGCCATCAC 


CAT C ATT TC A 


GGCGAAAATT 


TTTGGTTATG 


ATAGTGGAGA 


4260 


ATCTTTTCCT 


TTAGTTCCTT 


GGTCAAGCTT 


GATTTCTTGA 


CCGAGCGCTT 


GCGATTGTTT 


4320 


TCATAAGACT 


GTTGAGCATA 


GTCGGCAGAA 


TAAACCTCTT 


TGAAGCGCCC 


TTTTCCAAGA 


4380 


CATTGTCGGA 


CTGTCCCACG 


CTTGATTTCA 


GTGTGGATAG 


TTTGAGGAAC 


TTTTCCAAGC 


4440 


AGAGAGGCAA 


TTTCTCTATT 


TGATTTCCCT 


TCTTTTTTCC 


ATCTTTCGAT 


TAAGCGACGG 


4500 


CTATCGATTG 


TCAAATGTTC 


GCCTTTTGTA 


GTATAATGGT 


TTTGCATCTC 


TGTGCCTTTC 


4560 


TTGTGTTTGT 


GGTTGAACAA 


CAAGTATAAC 


ACAGAGGTGT 


TTTCTTATGC 


CTACAAGAGC 


4620 


TATCGGCTAG 


TTGAACCATC 


TAATTTTTAG 


GAGGGCTGGG 


TGGCTAACTT 


CATTATAGAA 


4680 


CTTTCATTTA 


CGAACATATA 


GTAAAATGAA 


ACAAGAACAG 


AACAAATCGA 


TCAGGACAGT 


4740 


AAAATC T AT T 


TCTAACAATG 


TTTTAGAAGC 


AGAGGTGTAC 


TATTCTAGTT 


TCAATCTATT 


4800 


ATATTTTTGT 


TTTTTATCAA 


AAAATACTTT 


ACAAGTTCTT 


AAAAACATGA 


TATAGTAATA 


4860 


AAGCTTAGAA 


AATGAGATGA 


TGTTTTCTAG 


CAAATATAAA 


CCCGAGTAAA 


AAATGCCTAC 


4920 


GGACAGGCAG 


GGTTGAATGC 


CGAAGCGTGG 


TTGAAAAGCC 


AC AT T ATTG A 


TAGGGTTAAA 


4980 


AGCCTACTTT 


TATAAGTTGA 


TGTTAGGACA 


CTTGTCCTAA 


TTCATAAATT 


TTTAGTGTGG 


5040 


TGAAAGCACA 


CGTCATCTTG 


TG AAACG AT C 


AATAAAGTAC 


GTAATATTTG 


C T AC TAG AG A 


5100 


GTTAGGAAAC 


AT CGGG AAC A 


G AC AT AC TC A 


ACAGAAACCA 


AAATAAACAC 


GTCAGAAGAT 


5160 


TGCAGAGCAG 


GTGAAAACCT 


GCTCTTTTTT 


CATGAGTCAA 


CCTTTAGTTC 


CTTAGTTTTC 


5220 


ATAAGGTCCT 


AAAAAT AT TG 


AAAGGAGTAT 


GTTTTGAAAG 


AGTTAGATCA 


AAACCAAGCC 


5280 


CCAATTTATG 


AGGCCTTGGT 


GAAGTTACGC 


AAGAAAAGGA 


TTGTTCCCTT 


TGATGTTCCA 


5340 


GGTCACAAGC 


GTGGACGGGG 


AAATCCAGAA 


CTTGTCGAAC 


TCTTAGGAGA 


AAAATGTGTA 


5400 


GGCATTGATG 


TCAATTCGAT 


GAAACCTTTG 


GATAATTTAG 


GCCATCCTAT 


T T C GAT T ATT 


5460 


CGTGATGCAG 


AGGAGCTGGC 


TGCAGATGCT 


TTTGGAGCTA 


GCCATGCCTT 


TCTAATGATT 


5520 


GGTGGAACAA 


CTTCATCGGT 


GCAGACTATG 


ATTCTGGCAA 


CCTGCAAGGC 


AGGAGATAAG 


5580 


ATTATTCTGC 


CACGAAATGT 


C C AT AAATCT 


GCTATCAATG 


CGTTGGTTCT 


ATGTGGTGCC 


5640 


ATTCCCATCT 


ATATCGAGAT 


GAGTGTAGAT 


CCTAAGATTG 


GTATCGCTTT 


AGGTCTTGAA 


5700 


AATGACCGAG 


TAGCACAGGC 


CATAAAGGAC 


CATCCAGATG 


CTAAGGCTAT 


CCTAATCAAC 


5760 



WO 98/18931 



PCT/US97/19588 



597 

AATCCTACTT ACTACGGCAT CTGTTCAGAC CTAAAGGGGT TGACAGAAAT GGCTCATGAA 5820 

GCTGGCATGA TGGTTTTAGT AGATGAAGCC CACGGAGCGC ATTTGCATTT C AC T GAT AAA 5 880 

CTTCCAATTT CTGCTATGGA TGCAGGGGCT GATATGGCAG CAGTTTCCAT GCATAAGTCT 5940 

GGTGGGAGTT TGACCCAAAG CTCCATTTTA CTTATCGGGG AGCAGATGAA TTCTGAATAC 6000 

GTTCGTCAGA TAATTAACCT GACCCAGTCT ACATCTGCCT CTTACTTGTT GATGGCTAGT 6060 

TTGGATATTT CACGTCGCAA CTTGGCCCTT CGTGGTAAAG AGTCGTTTGA GAAAGTCATT 612 0 

GAGCTATCTG AGTATGCCCG CCGTGAAATC AATGCTATCG GTGGCTACTA TGCCTACTCA 6180 

AAAGAGTTAA TAGACGGTGT TTCGGTTTGC GATTTTGACG TAACTAAGCT GTCAGTTTAC 6240 

ACTCAGGGTA TTGGCTTAAC AGGTATCGAG GTTTATGACC TCTTGCGAGA CGAATACGAC 6 3 00 

AT T C AG AT CG AGTTTGGTGA TATCGGCAAT ATCTTGGCCT ATATTTCCAT CGGCGACCGC 63 60 

ATCCAAGACA TCGAGCGCTT GGTTGGTGCT CTGGCTGATA TTAAGAGACT CTATTCAAGA 642 0 

GATGGAAAAG ATTTGATAGC AGGAGAATAT ATTCAGCCCG AGTTAGTGCT GTCTCCGCAA 64 8 0 

GAAGCCTTCT ATTCAGAAAG AAAAAGTTTA ACTTTGGATG ATTCTGTTGG ACAGGTCTGT 6540 

GGAGAATTTG TTATGTGTTA CCCTCCAGGT ATTCCTATCT TGGCTCCTGG TGAACGCATT 6600 

ACACGAGAAA TTGTCGACTA TATCCAATTC GCCAAGGAAC GTGGTTGCTC CCTCCAAGGG 6660 

ACGGAAGATC CAGAGGTCAA TCATATCAAC GTTATTAAGA GAAAGACAAA C T AT AAG AAA 672 0 

AGTCAATAGT TTTATCTAAA CTATTTCTTA TTTCAATTTG ATGATTTGGC GATGATTTTA 67 80 

GAGCACGGCA AAAAGCCCTT GAATTAGAAG CGGTCAATCG CTTAATTTCT ATCAGCTTAT 6 84 0 

CAAATCCTGC CTCAAGCCTT TTCTGAGGAT TAGGGTAGCG TGTCAAGAGT TGGTAGGTAT 6900 

ATTCTGAATG CTTTCCAACG ATTTTATCCA AC T C AGG AAA G ATG AT AT C A AGACAACGAG 6960 

TGTATTGTAC TTTCCAATCA GACTGTTTTT TCTTGAGACG ATGAATATGT CTAGCCAGTA 7 02 0 

TTTTTAGTTC TACTTGCCGA TTATCGTGTT GAAATTGTTC ACGATTGGGG TCAGAAAGAA 7080 

GTTTAAGAGC GATGCCATGA GCGTCTTTCT TATCCGTTTT AGTTTTGCGA AGTGATAATG 714 0 

ATTTGGCAAA TTTCTTGATG AGCAAAGGAT TGTAGGTGTA AACTTTATAT CCTTGTTCAT 7 2 00 

GCAGGAAGTT CAGTAGATTA AAGGCATAAT GTCCGGTATT TTCAAGAGCG ATGAGACAGT 72 60 

CTTGGTTGAG CTGTCGAAGA GACAGATCTA AGAGTTCAAA ACCAGCTTTA TT ATT T G AAA 7320 

AAGTGAGTGG TTTAAGAACA GTTTTTCCTG GAACATTCAA GGCTGTAACA TCGTGTTTAT 7380 

TT T T AG CG AC ATCAATGCCC ACATAAAGCA TGGGAGTATC TCCAGATATA GTATTTCAAG 7440 

TCTACTGGGT TATCCACGAA CTTTTTGCCT TGTTACCTTA GACGAGATAA AACGTCTATG 7500 
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CGT T ATC AAA. 


CT C ATT AC C A 


ATTGAAACAA 


AAAACTGTGG 


TTAGAGCCTT 


TCGGAAATCG 


7560 


TCAAGCGATT 


GGAGGAAATG 


AACTAATCCA 


CAGTGGCTTA 


TTCCAAGTAT 


ACCACTTGGG 


7620 


CTTTGGCAGT 


AGCTAACTGC 


GCTAAATATA 


ATATAAGGAG 


AAAT AG AT GG 


ATTTATGGTT 


7680 


TTCTGAAGTT 


CATACTCCAG 


ATGTCAAATT 


GTCTCTGAGA 


ACAGCCAAGC 


AACTTTACGC 


7740 


TGGAAAAAGT 


GAATGGCAGG 


AT AT C G AAGT 


CTTGGATACG 


CCAGCTTTTG 


GG AAAAT AC T 


7800 


GATTTTAAAT 


GGCCATGTCT 


TGTTCTCAGA 


TGCGGATGAT 


TTCGTCTACA 


ATGAAATGAC 


7860 


CGTTCACGTT 


CCCATGGCTG 


TCCACCCAAA 


T C C AAAG AAA 


GTATTGGTTA 


TTGGGGGTGG 


7920 


TGACGGCGGT 


GTTGCCCAAG 


TATTAACCCT 


CTATCCTGAA 


CTGGAGCAAA 


TTGATATTGT 


7980 


GGAACCGGAT 


GAGATGTTGG 


TCGAGGTCTG 


TCGTGAGTAT 


TT C C C AG AC T 


TTGCTGCAGG 


8040 


GCTAGATGAT 


CCTCGTGTTA 


C CAT T TAC T A 


CCAAAATGGG 


CTACGCTTTT 


TGCGAAACTG 


8100 


CGAAGATGAT 


TACGATATTA 


TCATCAACGA 


TGCGACAGAT 


CCATTTGGCC 


ATACGGAAGG 


8160 


ACTCTTTACC 


AAGGAATTCT 


ACGGCAATAG 


TTATCGAGCT 


CTGAAGGAAG 


ACGGCATCAT 


8220 


G ATTT AC C AG 


CATGGGAGTC 


CCTTCTTTGA 


CGAGGATGAG 


TCGGCCTGCC 


GAAGCATGCA 


8280 


CCGCAAGGTC 


AATCAAGCCT 


TTCCAATCAG 


TCGGGTTTAT 


CAGGCCCATA 


TTCCAACTAG 


8340 


CCCAGCTGGC 


TATTGGTTGT 


TTGGATTTGC 


ATCGAAAAAA 


TACCACCCTG 


TC AAAG ATTT 


8400 


TGACAAGGAA 


GGCTGGAAAA 


AACGCCAGCT 


TTTCACAGAA 


TACTACACTG 


CAAACTTACA 


8460 


CGTGGGAGCC 


TTTATGTTGC 


CCAAGTATGT 


TGAGGACATT 


TTAGAAGAAG 


AGGAAGGAAA 


8520 


AAAATGAGTC 


GTTTACTAGT 


TATTGGTTGT 


GGGGGCGTTG 


CCCAAGTTGC 


TAT TT C AAAG 


8580 


ATTTGTCAAG 


ATAGCGAAAC 


ATTTACAGAG 


ATTATGATTG 


CTAGCCGTAC 


CAAGTCAAAA 


8640 


TGCGATGACT 


TGAAAGCGAA 


GCTAGAAGGC 


AAAACAAGTA 


CTAAAATTGA 


AACTGCAGCA 


8700 


CTTGATGCTG 


ACAAGGTTGA 


AGAAGTGATT 


GCCCTGATTG 


AAAG CT AC AA 


ACCAGAAGCT 


8760 


GTTTTGAATG 


TAGCTCTGCC 


TTATCAAGAT 


TTAACCATTA 


TGGATGCTTG 


TTTGGCAACA 


8820 


GGTGTTCACT 


ATATCGATAC 


AGC CAACT AC 


GAAGCAGAAG 


ACACAGAAGA 


CCCTGAGTGG 


8880 


CGTGCTATCT 


ACGAAAAACG 


TTGTAAGGAA 


CTTGGTTTTA 


C AGC CT AC T T 


TGACTACTCA 


8940 


TGGCAGTGGG 


CTT AT CAAGA 


G AAAT TC AAA 


GAAGCAGGCT 


TGACTGCTCT 


TCTTGGTTCT 


9000 


GGTTTTGACC 


CAGGTGTAAC 


TAGTGTCTTT 


TCAGCTTATG 


CCCTCAAACA 


CTATTTTGAT 


9060 


GAAATCCATT 


ATATCGACAT 


TTTAGACTGT 


AATGGCGGTG 


ACCACGGTTA 


TCCATTTGCA 


9120 


ACCAACTTTA 


AT C C AGAAAT 


TAATCTCCGT 


GAGGTTTCTG 


CGCCAGGTTC 


TTACTGGGAA 


9180 


GATGGGAAAT 


GGGTCGAAGT 


CGAAGCTATG 


TCTATCAAGC 


GTGAGTATGA 


TTTCCCTCAA 


9240 


GTTGGACAAA 


AAGATATGTA 


TCTCCTTCAC 


CATGAAGAAA 


TCGAATCATT 


GGCCAAGAAC 


9300 
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ATTCCAGGTG TCAAACGCAT TCGTTTCTTT ATGACTTTTG GTCAATCTTA CTTGACGCAC 93 60 

ATGAAATGTC TTGAAAATGT TGGACTCCTT CGTACGGATA CCATTAACTT TAACGGCCAA 94 2 0 

GAAATTGTTC CAATTCAATT TTTGAAAGCC TTGCTTCCAG ATCCTGCCAG TCTTGGGCCA 94 80 

CGTACAGTCG GAAAAACCAA TATTGGATGT ATCTTTACAG GTGTCAAAGA CGGTGTCAAA 9540 

AAG ACT AT C T ATATCTACAA TGTCTGCGAC CATCAGGAAT GTTACGCAGA GGTTGGTTCG 9600 

CAAGCTATTT CTTATACGAC AGGAGTTCCA GCCATGATTG GGACAAAATT AGTCATGAAC 9 660 

GGAACTTGGA AACAAGCTGG AGTGTATAAC CTTGAGGAGT TAGATCCAGA TCCATTCATG 972 0 

GAAGCTTTGA ATGAGTATGG TTTGCCATGG GTTGTGGTTG AAAATCCACA AATGGTGGAC 97 80 

TAATGAAGTT AGAACAAGTA CCAACACCAG CCTATGTTAT TGACTTGGCC AAGTTAGAAG 9840 

CTAATTGCCG CATTCTACAA TATGTACAAG AAGAGGCCGG TTGCAAGGTC TTGCTTGCCC 9900 

AGAAGGCATA TTCCCTCTAC AAAACTTATC CCTTGATTAG CCAGTATCTA TCAGGTACGA 9960 

CAGCTAGTGG ACTCTATGAG GCCAAATTGG CAAGGGAAGA ATTTCCTGGT GAAGTCCATG 10020 

TATTTGCGCC TGCTTTCAAG G AT GC AG ACT TGGAGGAATT G CT AG AG AT A ATGGACCATA 1008 0 

TAGTCTTTAA CTCAGAGAGA CAGTTGCGTA AACACGGTCC GCGTTGTCGA GAGGCTGGTG 10140 

TCAGTGTTGG TTTGCGCCTC AACCCTCAGT GTTCAACTCA AGGcAGATCA CGCGCTCTAT 10200 

GACCCTTGTG CACCAGGTTC TCGCTTTGGA GTTACTATAG ACAAGATTCC GAGTGATTTG 102 60 

C T AG AT TTGG TTGACGGACT TCATTTTCAT ACCCTTTGCG AGCAGGGAGC AGATGATTTA 103 2 0 

CAAACAACTT TGAAAGCAGT AGAAGAACAG TTTGGTCCCT ACTTACATGA GGTAAAATGG 103 80 

CTCAATATGG GTGGTGGTCA T CAT ATT AC A AGAGAAGGTT ACGATGTGGA TTTGCTGATT 10440 

TCAGAAATCA AGCGTATCCG AAAAACTTAC AATCTTGAAA TCTATATCGA GCCTGGTGAA 10500 

GCCATTGCGC TTAATGCGGG TTATTTAGCA ACTGAGGTAT TAGATATTGT AG AAAACGG T 10560 

ATGGAAATCT TGGTTTTAGA CGCCTCTGCG ACCTGCCATA TGCCTGATGT ACTTGAGATG 1062 0 

CCCTATCGTC CACCTTTGAG AAATGGCTTT GAGTCACAGG AAAAAGCCCA TACCTACAGA 10 680 

CTTTCTTCTA ATACCTGTCT GACGGGCGAT GTGATTGGTG ATTATAGTTT TGAAAATCCA 10740 

GTCCAAATCG GAGACAGACT TTATTTTCAA GACATGGCCA TTTATTCTTT TGTCAAAAAT 10800 

AATACCTTTA ATGGTATTGG ATTGCCAAGT CTCTATCTCA TGGACGAACA GGGAGACTGT 10860 

AGCTTACTCA AAGCTTTTGG CTATCAAGAC TTTAAAGGGA GATTATCATG ATGGACAGTC 1092 0 

CAAAAAAATT AGGCTATCAC ATGCCAGCAG AGTACGAACC CCATCATGGT ACCCTCATGA 10980 

TATGGCCGAC TCGACCAGGA TCATGGCCTT TT C AAGG AAA GGCTGCTAAA AGAGCATTTA 11040 
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CT C AG AT TAT CGAGACCATA GCAGAAGGGG AAAGAGTCTA TCTTTTGGTG GAGCAGGCCT 11100 

ATCTATCTGA AGCCCAATCC TATCTTGGAG ACAAGGTTGT TTATTTAGAC ATTCCCACCA 11160 

ATGATGCCTG GGCGCGTGAT ACTGGCCCAA CCATTCTCGT CAATGATAAA GGTAAGAAAT 1122 0 

TAGCCGTGGA TTGGGCCTTC AATGCTTGGG GAGGCACCTA TGATGGTCTT TATCAAGATT 112 80 

ATGAAGAGGA TGACCAAGTA GCCAGTCGTT TTGCTGAGGC CTTGGAAAGG CCTGTCTATG 11340 

ATGCTAAACC TTTTGTACTG GAAGGAGGCG CAATCCATAG CGATGGTCAA GGAACTATTC 11400 

TCGTAACTGA AAGTTGCTTG CTTAGTCCTG GTCGCAATCC TAACTTGACT AAAGAGGAGA 114 60 

TTGAAAACAC AT TAT T AG AA AGTCTTGGTG CTGAAAAAGT TATTTGGCTT CCTTATGGTA 11520 

TTTATCAGGA TGAAACCAAT GAACACGTCG ATAATGTTGC TGCCTTTGTT GGTCCTGCTG 11580 

AGCTTGTTTT GGCTTGGACA GATGACGAAA ATGATCCCCA GTATGCCATG TCAAAAGCAG 11640 

ATCTCGAACT CTTAGAACAG GAAACAGATG CAAAAGGTTG TCACTTCACC ATTCATAAAT 117 00 

TGCCTATCCC TGCAGTTCGA CAAGTTGTGA CAGAAGAAGA TTTGCCAGGC TACATCTATG 117 60 

AAGAAGGAGA AGAAAAGCGA TACGCAGGTG AACGACTAGC AGCTTCCTAC GTAAACTTTT 11820 

ATATCGCCAA CAAGGCTGTC TTGGTTCCAC AGTTTGAGGA TGTAAACGAC CAAGTGGCCT 11880 

TAGATATCCT CAGCAAGTGT TTCCCAGACC GTAAAGTTGT CGGAATACCA G CC AG AG AT A 1194 0 

TTCTCTTAGG TGGTGGCAAT AT C C ACTGT A TCACCCAACA AATTCCAGAA TAGGAGAAAA 12 000 

AGATGAGAAA TGTAAGAGTT G C AAC C ATT C AGATGCAATG CGCTAAGGAT GTGGCAACAA 120 60 

ATATCCAAAC CGCAGAGCGT TTAGTACGTC AGGCTGCTGA GCAAGGAGCC CAAATTATTC 1212 0 

TCTTGCCCGA GTTGTTTGAA CATCCCTATT TCTGTCAGGA ACGTCAGTAT G ACT AC T AC C 12180 

AGTATGCCCA ATCTGTAGCG GAAAAT ACT G CCATTCAGCA TTTTAAGGTG ATTGCTAAGG 12240 

AACTACAAGT TGTTTTACCA ATCAGTTTCT ATGAAAAAGA TGGTAATGTC TTGTATAACT 12 3 00 

CTATTGCCGT CATTGATGCA GATGGGGAAG TGCTGGGCGT TT AT C GAAAG ACCCATATAC 12 3 60 

CAGATGACCA T TAT TAT C AA GAAAAATTCT ATTTCACGCC TGGTAACACT GGTTTCAAGG 1242 0 

TCTGGAATAC TCGCTATGCT AAGATTGGTA TCGGTATCTG TTGGGATCAA TGGTTCCCTG 124 80 

AAACAGCGCG CTGTCTTGCA TTGAATGGTG CTGAATTGCT CTTTTATCCT ACAGCTATCG 12540 

GTTCAGAGCC AATTTTGGAT ACAGATAGTT GTGGTCACTG GCAACGTACT ATGCAAGGGC 12 600 

ACGCAGCAGC GAATATTGTT CCAGTCATCG CAGCCAATCG TTATGGTTTA GAGGAGGTTA 12 6 60 

CTCCTAGTGA GGAAAATGGC GGACAGAGCT CCAGTCTTGA CTTCTACGGT TCCTCCTTTA 12720 

TGACGGATGA AAC AGGAGC T ATTCTAGAAC GAGCTGAAAG ACAAGAAGAA GCTGTTCTGT 12780 

TAGCTACTTA TG AC CT AG AC AAGGGAGCAA GTGAACGCCT AAACTGGGGC TTGTTTCGAG 12840 
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AT AG AAG AC C AGAAATGTAT AGACAAATTA CAGATTAGTG TGGGAGAAAT GAGAGATTCA 12900 

TTCTGCTAGA CTAACTTCTT ATTAGTAACT ATAAGATACT ATGGCATCTA GTAAATCGAT 12 960 

TTTTATGATT CGCTATTCTT GTCTATTGAT TAGTCCGTAT TTTAAAATAT TAGCAAAAAA 13020 

GCAAATAGCA GTAACTTCTG TCTATTTGCT TTTCTTTTTT ATAGAATATA TTTCTCAATA 13 0 80 

GCACGCGCAA CGCCGTCTTC TTCGTTGCTT GAGGTAACGG CATCCGCAAG AGATTTGATA 13140 

TAATCGCTGG CATTTCCCAT TGCAATCCCA AGCCCTGCAA ACTGGAGCAT TTCGATATCG 13200 

TTATTAGCAT CGCCCATGGC CATAATCTCT GAGGAATCAA TCTTCAAAAT CTCAGCTAGT 132 60 

CGTGAAAGAG CAGTAGCCTT TGTCGTTCCA AGCGGCATTG CTTCATAAAT GACAGGCTGC 13 320 

GAACGAACTC CACTGAATCG TTGGCAAAGC TCTTCAGCAA AACGCTGCTC AAAATCGTCT 13 3 80 

GTTTGTTCTT TTGTTCCTAA ACACATACCT TGGAACATCC GGAACTTTCC ACTAGTCGCT 13 440 

TCTTCAAGAG AAATTTCAGT CAGGTCTGAA AATACTAGTT TAGCATCATT TTCAATAACT 13 5 00 

TGATTGGGCT TGTCACCGAG AACAAAATAA TGTGACTCGT CAAAAAGTGT CAACTGAACA 13 5 60 

TCACTCTTTT CAGCAAGGTC ATAGAGGTAT TCGATGTCAG CTGGACTCAG TTCTTTCCAG 13 62 0 

TCAACTAGAC TCCAATCACT GGTCTGGTGA GTTGAACAAC CGTTGTTAAC AATAATATAT 13 680 

TCGTTCTGGA GGTCAAGCTC CAGTTTTTTG TAGTAGGGGA GGACACCGAA AAGGGGGCGA 1374 0 

C C C GT AC AG A GAACCAGTTT GACACCTTTT TCAATGGCTT TGTGAATAGC AGTAATGTGT 13 800 

GCTTGTGGGA TTTCCTTGGC TTCATTGAGG AGGGTGCCGT C CAT AT C C AA GGCTAGTAGT 13 860 

TTAATCATAG GTCTTCCTCT TTATCTTTGC TATTATTATA GCATATTTTG GAGAAGAAAT 13 920 

TGATAGAAAG CTTGAGACTA ATTGATTTTA TAGTTTAAGA TGTTTTGATG ACAATTCATG 13 9 80 

ATTTGAAGAG GATATTTCGC AAAGATATGC TATACTATGT TTGTCAATGT TGCAACTAGA 14040 

CAAATTAAAA AACCAACTTA ATATAATAGT TTTTTTGTAA GTAGGTATGA GT AG C AG ATT 14100 

ACTCAACTAA TCTGAAGAAT AATGGAGGAA ATATATCATG ATTTTAATGA CAAAAAATAT 14160 

AAATCTAACA AATGAAGAAT TAGAGCTGAT ACAAGGTGGA GCAGATCCAT ATGGTAAAAA 142 2 0 

TCCTAATGGT AGGTACGATT GGGAAATAGA ACCAGTATTA ACTCTGCTGG TTCATGGATT 14 2 80 

TTGTCCCAGA GGCACCTATG ATTCAGGATA TATTGGAGGA GGTAATCATC TTTGCAAAGG 14340 

AAGTGCTGCG AGATTTTAAG TAAAATTTAT TAGGAATATG AAGAAACAAG GGGAGAAAAC 14400 

AGAGGATTTA ATATGAAAAA ACGAGCTATT CAAATTTTAC TAGCATTGTC CTTAATTTTT 144 60 

TACAAATCAA CTTGGTTTTG GAGGCTTTTC AATTATCTCG CAAAGCCCTA T C T AC C AG C A 14520 

AGTCGTGAAT TTTTTCAGAT TCTGCTTTTG AT GG AG AG C G GAGTTCTTTT CTTAGCGGTC 14 580 
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AT C TAT C T AC TGGTTTTTGC AG G AAAG AAA ATTTTTCATT TCAAGTGGCA GCTGAGGTAC 14 64 0 

TTCATCTACC TTTTACTGGG CT AC AT CAT T T C AT AT ATGT CTGACTTCCT CTTTTCGTAT 14700 

TTCATATCCC TGTCTTCAAA TCAGATTTCT TTG AAT G AAA CGGTAGAAAT GATGGGGAGA 147 60 

CAGGAGTTCC CTTATGTCTT GCTCATCGTT TGCTTCATCG CCCCTATTGC TGAGGAATTG 14 82 0 

ATTTATCGAG GtGTGCTTAT GACAACCTGT TGCAAAAACT CACCTTGGTA CG 14872 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGTGCTATCG GTCTCAAAAC CAATCTGGTC GCTATGGTCA AATCCAGTTG GAAAATCCAT 6 0 

TCTTCTTGGA GCCATCTGCT GGATTGCCAT CATCCTCACC ACTCTTGGTA TGCAGACCCT 12 0 

TATCGGCATT T TC T AAT ACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGCC 180 

GTAGGTATAT GTTACTGACT TCGTCAGTTC TATCTGCAAC CTCAAAACGG TGTTTGAGCT 24 0 

GACTTCGTCA GTTCTATCTG CAACCTCAAA ACGGTGTTTT GAGCTGACTT CGTCAGTCGT 3 00 

ATCTACAACC TCAAAACAGT GTTTTGAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA 3 60 

ACAGTGTTTT GAGCAGCCCG TGGCTAGTTT CCTAGTTTGC TCTTTGATTT TCATTGAGTA 42 0 

TAACACAAAA GGTAGCCCAT CAGCTACCTT TTTCTTATGC TTCCTCAATC AAGCGAGTAT 4 80 

GTTCTCTCTT GATACAGCGA TTCATCACGA TATCATCACA TCCACCATCA CGCAAAATCT 54 0 

CTTTCGCTTC TAAACTTTCA AGTCCTAGCT GTGCCCAAAA AATCTTGGCA TCAGCTTTGA 600 

GAAAATCACG CGCCACATCG GGCAGAAATT CACTGCGACG AT AAAC ATT G ACAATATCTA 660 

CAGGAAAAGG AATTTCAGCG AGGCTAGCAT AAGCCTTTTC ACCCAAGATT TCGCCACCTG 72 0 

CCGCCTTGGG ATTGACTGGG ATGATTTTAT AGCCCCGAGC CTGCATTTCC TTTGTTACTC 7 80 

GATTGCTGGT TGTTTCTTCA CGGTCAGACA AACCCACCAC AGCAAGGGTT TTACTCGTTG 84 0 

CG AGAT AC T G ACGAATCACG CCATCACTTG GAT T GAT AAA TTCTTGACTC ATAGAAATCC 900 

TCCTTTTTCA TCAGTATAGC ACATTTTGAA AAGGTTTGCA GAATTATACT ACAAAAAAGG 9 60 

AGGACTAGCC CCCTTTTTAT TTAGCCTCGT ACCAGGTTGC CCCTTCATTC TCATCTGCGA 102 0 

TAAGAGGAAC ACTGAGTTGA ATGGCTTCTT CCATGGTTTG TTTCACCAAT TTTTTCATCT 1080 

CTACCAATTC AGATTTAGGC ACTTCAAGGA CGATTTCATC GTGCACTTGT AAC AG CAT C T 114 0 
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TAGTCTGATA ACCACCTGCA ACCAAGGCTT TATCCAGCTG AATCATGGCA ATCTTGAGAA 12 00 

TATCTGCTGC CGAACCCTGG ATAGGTGAGT TGATAGCAGT TCGCTCCGCA AAACCACGAA 12 60 

TATTGAAGTT GCGCGAATTG ATATCTGGCA ACTCACGGCG ACGCTTAAAG AGGGTCTCTA 132 0 

CATAGCCCTT ATCACGCGCC TCCCGCACCA CTTCATCCAT GTAGTTTTTA ATACCTGGAA 13 80 

AACGTTCAAA GTAGGTATCA ATGTAGGCTT TGGCTTCCTT AC G ACT AATT CCCAAATTAT 144 0 

TAGACAAGCC AAAGTCTGAA AT C C CAT AAA CCACTCCAAA GTTAACTGCC TTGGCATTGC 15 00 

GACGGTCGTT TGCAGTCACA TCATCAGGAC GCTCAATGCC AAAGACCCGC ATGGCTGTCG 1560 

AAGTATGGAT ATCTGCCCCC TCTTGGAAGG CCTTAATCAA GTGCTCATCC TTAGAAATAT 162 0 

GCGCCAAAAC GCGCAATTCA ATCTGTGAAT AGTCAGAGCT GAGTAGCACA CTATCCTCCC 16 80 

ACTCTGGCAC AAAAGCCTTC CGAATCAAGC GCCCCTGTTC CAATCGGGCA GGAATATTTT 1740 

GCAAGTTTGG ATCCACACTA GACAAACGCC CGGTCTGGGT CAAATCCTGC AC AT AGC G AG 1800 

TATGAATCTT TCCATCAGCC AAAATCCAGT CCTGCAAGCC AATTACATAA GTAGATTGAA 18 60 

TCTTAGCAAT TTGACGGTAA TCCAGGATTT TCTTAACAAT CGGAGCAATA GGAGCGAGAC 1920 

GCTCTAAAAC ATCCACTGCT GTCGAATAAC CTGTCTTGGT TTTCTTAGTG TATTCTAGAG 1980 

GAAGTCCCAA TTTCTCAAAG AGAAGCACGC CCAACTGCTT AGGCGAGTTG ACATTAAACT 204 0 

CCTCACCAGC CAGCTCGTAA ATCTCTTGAG TCAGTTTTTC AATGACAAGC TCATTTTCAG 210 0 

CCTGCATCTC AAGCAAGGTC TCTTTCTTGA CCATAATCCC AGCAATTTCC ATCTTGGCAA 2160 

GG AC AAAAG C CAGAGGTTGC TCCATATCAT AAAGAAGCTC TAATTGCCCA TTTTCGCTGA 2 22 0 

GTTTTTCAAG TAAAATAGGC TCTGTTTCTA CCAAAACAGC AAGTTTACAA GCTAAGTGTT 22 80 

CCAAGAATTT CTCACGTTCA GGAATGGCCT TTTTAACACC CTTACCGTAG AAAGTTTCAT 2340 

CATCAACCAA GTAAGTCTGA C C AT AAAG AC TAGCGATGGT CGCAATTTCA TTGTCCTCCA 2400 

CAGTCGAAAG GAGGTATTTA GCCAAACGGA TGTCAAAAGC AGGCGCCTGC AAATCCACAC 2 4 60 

CAAAACGTTG CAAAAGAACT TTAACCTTCT TAAAGTCATA AACTCTCAGA GATGTTTTTT 2 52 0 

CTAAGAAATC CTTGAAAATC GGGTCTTGCA ACAGCTCAAG CTTGTCTGTG GCATAGAGCT 2 580 

TATCCCCACA AGACCAGACA AATCCAACCA AATTATCCGT ATGGTAATTC TCACCAAAAA 2 640 

GCTCAAAGTG GAAGATAGAC TCTTCACTCA GCATATCTTG ACTGATTTGG TCAACAATAG 27 00 

TAAAATCCAA ACTCTCAGAC AC AT C AG C T G ACG AC AC AT T TAAAGCCTGC TTTAGCTGTT 2 7 60 

TG AAGC C CAT CTCATCGTAG AATTTCCCAA GATTTTCAAC ATCTGGACCA C TAT AG AC C A 2 82 0 

AGTCCTCTAA ACCAATCGCA ATCGGTGCCT TGGTATCAAT GGTCGCTAGT GTTTTAGACA 2 880 
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AAAAGGCCTG TTCCTTGTCA TTGATGAGAT TTTCCTTCAT CTTAGAAGTC TTCATTCCAT 2 940 

CAATATTTTC ATAAATCCCC TCAAGCGAAC CATGCTCCAG CAAGAGCTTA ATACCCGTCT 3 000 

TTTCACCGAC TTTGGTCACC CCAGGGATAT TATCCGACTT ATCACCCATG AGCGCCTTGA 3 060 

GATCGATAAA CTGAGCTGGT GTGAGGCCCA TTTCTTCCAT GAGGTAATCT GGCGTAAAGG 312 0 

CCTCAAACTC AGCCACACCT TTCTTGGAAA TTTCAACCAC CGTATGCTCA TCCGTCAGCT 3180 

GAAT C AAAT C CTTGTCCCCA CTGACAATAG TAATATCAAA ACCATCCTGC TCTGCTAGCT 324 0 

TATCCAGCGT CCCAATGATG TCATCCGCCT CATACTGAGC CAGATCATAG TG AC GAAT C C 3 3 00 

CCATATGATC CAGCAACTCA CGAATGAAAG GAAATTGCTC ACGAAACTCA TCAGGAGTCT 3 3 60 

TGGCCCGACC ACCCTTATAG TCCGCATACA TCTCTGTCCG GAAGGTCGTC TTTCCCGCAT 342 0 

CAAAAGCCAC C AAAAT AT G A CTCGGCTCAA CCCGCTCCAA TAAATGACTC AACATCAACT 3 4 80 

G AAAAC CAT A AATCGCATTG GTATGCAAAC CAGCCACATT CTTAAAACGG TCCAACTGCT 354 0 

G AT AC AG CG C AAAAAACGCC CGAAAAGCTA CAGAAGACCC ATCAATCAAT AATAATTTTT 3 6 00 

TCTTATCCAT ACACCCATTA TAAAGGAAAG AATCAAAAAA TACCATTGGG AAGAGCTAGA 3 6 60 

GCAAGTATTT TTCAAACTTT TTCCGAATAA ATAGATAGAG C C AG AG AATT TAGTAAACCT 372 0 

AGATTTAAAA ATGTGCTATA ATATAGTATA TTGAATCTAT AATAGTACAC CTTGACTGCT 37 8 0 

AAAAT ATTTC TAT AAAT T AA TTTGACTTTC CTGATAGAGT TATTCACATC TTATTTCAAC 3 840 

T C ACT AT AG A AGGAGGAATA GGAGGATTCT CAGACATCCG GGCATCAGCC CAACTAATGA 3900 

TTTGATTGCT AAGAAAATAT TCAGCAATCC AGAAATCACT TGTCAATTTA TTCGCGATAT 3 9 60 

GCTGGACTTG CCAGCAAAAA ATGTGACCAT TTTGGAGGGA AGCGATATTC ACGTATTACT 402 0 

CTCCATGCCT TACTCGGTGC AGGATTTTTA TACCAGTATA GACGTCTTGG CGGAGTTGGA 4080 

TAACGGTACT CAAGTAATTA TTGAGATTCA AGTCCATCAT CAGAATTTTT TCATCAATCA 414 0 

CTTGTGGGCT TACCTGTGCA GTCAGGTTAA TCAAAATCTT GAAAAAATTC GTCAGCGAGA 42 00 

AGGTGATACT C AC T AG AGCT ACAAACACAT CGCTCCTGTT TACGCCATTG CTATCGTGGA 42 60 

TAGTAATTAT TTCTCAGATG ACCTGGCTTT TCATAGCTTT AGTATGCGCG AAGACACAAC 43 2 0 

AGGTGAGGTA TTGGCGATTA CCAACAATGG ACAGGAAAAC CATCTGGTTA AG ATGG C ATT 4380 

CTTGGAATTA AAAAATACAG AGAAACCAGC AAAGACAAGG TTCGCAAGCC ATGGTTGGAG 4440 

TTTTTCGGCA ACAAGCCCTT TACCCAGCAA CCGCAACGAG CCATTACCCA AGCAAATCAA 4500 

CTGCTGGACT ACAAGAGCTG GTCCGAGGAG GACAGGAAAA TGTTTAGTCA ACTACATATG 4560 

CGAGAAGAAC AAGTCTTGTT AGCACAGGAC TATGCCTTGG AAACTGCTAG GGCTGAAGGC 4620 

CTTGAACAAG GACTAGAGCG TGGGAAAGTT GAAGGAAGGG CAGAAAGGAA ACTTTTTGCC 4680 
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TTCCTAGACA TAGTACGCCA AGGTCTTCTG ACTTCTGAGG TTGCCAGCCA GCAATTAGGT 474 0 

ATGTCAGTAT CTGAATTTGA GGCACTGTTG TAAAATGGCT CCATAATATC CATAGTGGGT 4800 

AAATCCCCTA TGGATATTAT GGAGCCTATT TTGTGTAGAA AAAAAGTCCC ATATGACCTA 4 8 60 

TAATGAAAAG CGACAAAACA ACTCATTAGA AAGAATCATA TGGAACAATT ACATTTTATC 4 92 0 

ACAAAATTAC T AG AC AT T AA AGACCCTAAT GTCCAGATTT TAAACATCAT CAATAAGGAT 4980 

ACACACAAGG AAATCATCGC CAAACTGGAC TACGACGCCC CATCTTGCCC TGAGTGCGGA 5040 

AACCAATTGA AGAAATATGA CTTTCAAAAA CCTTCTAAAA TTCCTTATCT TGAAACGACT 5100 

GGTATGCCTA CAAGAATTCT CCTTAGAAAG CGTCGATTCA AGTGCTATCA CTGTTCAAAA 5160 

ATGATGGTCG CTGAAACTTC TGATGACGTA CAGTCATATT TCTTCTCTTT TTATTATATC 522 0 

ACAGTTTTAA ATCTAGCTTT ACT AG ATT C A CCGCTACTAT CTATTTATTC GGAAAAAAGA 52 8 0 

CGAAAAAACC TGAGAATCAT CTCAGGCTTG GTCATTAAAT TTTTTTCTCA ATATCGAAAA 5 340 

GTGGAGAAAG TGGTCGTTTT TCATGAATAC GTACGATAGC ATCCCCTAGG AGATGAGCGA 54 00 

TTGAAATCTG CTCAATCTTA TCAATCAAAC GCTCTTCTGG C AG AT AG AT G GTATCCAAAA 54 60 

CAACCAATTT CTTAATAGCT GATTTTTGGA TATTGTCCGT AGCAGGACCA GAAAGAACTG 552 0 

GGTGCGTACA GCTTGCATAG ACTTCAACAG CACCAGCTTC CGCAAGAGCA TCTGCCGCAT 5580 

GACAAATCGT TCCAGCGGTA TCAATCATAT CATCAATCAA GATACAAGTC TTGCCTTCAA 564 0 

CCTTACCGAT GATATTCATA ACTTCACTAG TATTCATCTT ATCAACGCTA CGACGTTTAT 5700 

CAATAATAGC GATAGATGTT TTCAAAAATT CTGCCAACTT ACGAGCACGA GTCACCCCTC 57 60 

CATGGTCCGG GCTGACAACC ACATAGTCAG AACCAACCAT ACCACGACGC TCAAAATAAT 582 0 

CTGCAATCAG AGGAGCACCC ATCAAATGAT CCACAGGAAT ATCAAAGAAT CCTTGAATTT 5880 

GCGCAGCATG CAAGTCGATG GTCAATAAAC GATCCACTCC AGCTACTTCA AGCATATTTG 594 0 

CG AC AAGTTT TGAAGTGATT GGCTCACGCG CTCTCGCCTT TCTATCCTGA CGTGCATACC 6 000 

CATAGTAAGG CATGACAACA TTGACAGATT CTGCACTCGC ACGCTTCAAA GCATCTACCA 6060 

TAATCAAAAT TTCAAGCAGA TTGTCATTTA CAGGCGAACT AGTTGATTGT AAGATAAAGA 612 0 

CGTGTTTCCC ACGGATTGAT TCTTCAATGT TGACCTGAAT CTCTCCATCT GAAAATTGGC 618 0 

GAACACTTGA TTTCCCCAAC TCTATCCCAA TCTCCTGCGC CACACGTTCT GCCAATTCTT 6240 

TATTAGAAGA AAGGGCAAAC AGCTTTAAAT CAGAAAAAGA CATGATTTCC TCCGGTATAT 6300 

AT GT AT AACT TGTGCTTTTC ACAAGATTTT CCATCTACCA TTGTAGCGCT TTTTGCACTA 63 60 

TTTTTCAATC AAAAATAAAA GAAGGGCACC ATATTTGTAC CCTTGCATCA TTCTTTTGAA 642 0 
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AAATATTCTA GGTCATCAAC TCATTGTGTT TCTCAACAAA GCAATAAGCA TGATAAAAAC 64 80 

CAT AG AG AG C AAT AG CCGT A ACCACTGGAA TCGCTAAAGG CAACTCTGTT TCCAACTCCA 6540 

CAAAAGGAGA GTTAAACAAG AAGTGAGTTC CCAAGGCTAA ACCTAGAAAA ATAAGGCCCT 6600 

GTTTCTTGCC AACCTTCTGT CCTTTATAGG CTCTGTAAAG CAAGTAAACA C C T AC T AC AG 66 60 

CTAGACCTGA AAAAGT C C AG TGAGAGGCAA TTCCTGAGAT GATACGCTCT AAAATTCGCG 672 0 

AAATAGTAAA GTCAAAGCCC TCTGGCAAAT CCGTACGAAT ATAACCAATA TCCTTAATCA 67 80 

TTTGGAATCC CAAACCGGAA GCAATTCCAA GTAAAAACAA AG ATT TT AAT TTTCGCACAG 6840 

GAATCAAAGC CAAAACAAAA ACAAGTGACA ATAATTTCAA GGGTTCTTCT ACCAAAGGAG 6900 

CCGCAATAGC ACTTTCAAAG GCATTTAAAA ATGGACTATC TGGGAAAAGA ACCCCCAGTA 6960 

AAT C AT GG AT ATAAGTATTA GCAAAACTAG ACAACCAGCC TGAAAGGAAC ATCCCTCCCA 7 02 0 

ATAAAGACAG AATCAAAACC TTCTTTGGCA ATTCCCATTT TTCCCAATAC GGAAGAGAAA 7 080 

ATAAAGAGCC GG AAT CATGT AAAAGAGAGC TAGAAAGATA GAAACTCCCA T T AGT C CAT A 714 0 

TTCCGCACCT GACCTCGAAC CGTCCGTATA GTAGATGGTT TCATACTGTA AACCAATACA 72 0 0 

TAGCAATAAA ATAAAAATAA ATAAAATATT GCTTTTCTTC ATACACTTTC TTTCTAAATG 72 60 

AAGT AT TT AT AATTCTACGA CTGTCATACT TCCTGTATCA ACATTGTAAA TGGCACCAGA 73 2 0 

GATAATGACA TCGTCTGGTA TTAGGGGAGA CTCGATAAGC AG T T G CAT AT CCTCGCGTAC 73 8 0 

ACTCTCTTCT ATATCTTGGA AGGGCAAGAA GTCCTGGTCT G AC AC AT C G A CACCCAATTC 7440 

TTCCTTCAAA TACTCCTGAA AAGGTTCATT TTCAAAGGTC TGAGCACCAC AGTCTGTATG 7 500 

AT GC AAT AC C ACAATTTCTC TTGTCCCCAT TTGTTGCTGG GAAATAACTA GAGAACGAAT 7 5 60 

CATATCCTCA GTCACTCGAC CACCTGCATT CCGCAAAATA TGAGCATCCC CAAGTGCCAA 7 62 0 

ACCTAGAGCT TGCGCAACGT GCAAACGTGA GTCCATACAG GTCACAATGG CTACTCTGGT 7 680 

TTTAGGTTTA AGTGGCAGAT TTAACTGCCC ATGTAGGGCA ACATAAGCCT GATTGGCTTG 77 40 

C AT AAACTG T TCAAAATACG ACACGATTCC CTCCTTGAAA ATTTGATAGT CAAATATTTC 7 800 

TCCTATCTTA TCATTTTTAA GAG AAT T T GT CACGGATTAT GCAAAGACCT TTTTCAAGAC 7 8 60 

TTCCTGAATC GTTGTCACGC CAATGACCTG AATTTCCTTA GGCAGAGTGA TTCCTGTCAA 7 92 0 

GGAATTCTTA GGTACATAAA TCTTAGTAAA GCCCAGTTTA GCAGCTTCGT TGATGCGTTG 7 980 

CTCAATACGA TTCACGCGCC GAATCTCTCC TGTCAAGCCC AGTTCTCCGA CAAAACATTC 8040 

CTGAGGATTA GTTGGCTTGT CTTTGTAGCT CGAAGCAATA GCAACTGCAA CAGCCAAGTC 8100 

AATCGCAGGT TCATCCAATT TAACACCACC AGCAGATTTG AGATAGGCAT CCTGATTTTG 8160 

CAAGAGAAGC CCTGCCCGTT TTTCCAAAAC AG CC AT AAT C AAGCTAGCAC GGTTAAAATC 82 20 
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AAGTCCTGTC GTAGTACGCT TGGCATTTCC AAACATGGTC GGTGTTACCA AAGCCTGAAC 82 80 

CTCCGCCAAA ATCGGACGCG TCCCTTCCAT GGTTACAACG ATGGAGGAAC CAGTCGCCCC 8 34 0 

ATCCAAACGC TCTTCTAGGA AAACTTGACT CGGATTGAGT ACCTCAACCA AGCCGCCCGA 8400 

CTGCATCTCA AAAATCCCAA TCTCATTAGT GGAACCAAAA CGATTTTTGA CCGCTCTCAA 8460 

AATACGAAAG GTGTGGTGAC GCTCCCCTTC AAAGTAAAGC ACCGTATCCA CCATATGCTC 852 0 

C AAC AT AC G A GGCCCAGCCA AGGTTCCTTC TTTGGTCACA TGACCTACGA TAAAGATGGC 85 80 

AATGTTATTG GTCTTGGCCA ACTGCATGAG TTCAGCGGTC ACTTCACGCA CCTGAGAAAC 8640 

AGACCCCTGC ACCCCTGAAA TCTCAGGAGA CATGATGGTC TGGATGGAAT CAATAATGAG 87 00 

AAAGTCTGGC TGGATACGCT CCACTTCTGC ACGAACACTC TGCATATTGG TCTCTGCATA 87 60 

GAGATAAAAC TCACTATCAA TATCACCTAA GCGCTCTGCA CGTAGTTTAA TCTGCTGGGC 8820 

AGACTCCTCC C C ACTG AC AT AGAGAACTGT CCCCACTTGG GACAACTGGG TTGAGACTTG 8 880 

TAGGAGAAGA GTTGATTTCC CAATCCCAGG ATCCCCACCG ATAAGGACGA GACTTCCTGG 8 940 

TACCACTCCG CCTCCAAGCA CACGGTTGAA TTCCTCCATC TCCGTCTTGG TTCGATTGAC 9000 

ATTGATGGAA GTCACCTCAG CTAGTTTCAT GGGCTTGGTT TTCTCACCTG TCAAGGACAC 9 060 

ACGCGCATTC TTAACTTCGG CAACCTCAAC CTCTTCCACA AAAGAAGACC AAGACCCACA 912 0 

GTTGGGGCAA CGTCCCAGAT ATTTAGGGGA ATTATACCCA CAATTTTGAC ATACAAATGT 9180 

CGCTTTTTTC TTTGCGATGA CAAACCTCTT TCTATATCTC T AACT C AC AC TCAATCACTT 924 0 

GGCAAAAATC AATCTTCTCA TTTGGCACAA ACTGGCGCAT GAGCATTCGA TGAGCAACAA 93 00 

CTACCACAGT CTGATGTTCT CGATACTTAG AC AT AC AT T C TAGAAACCGA GACTTCATTT 9 3 60 

CCGTAGCTGT CTCATATTGA ATAGGACTAT TAGGAAGCAA CTCCCCCTTG TTTTCTAAAA 94 2 0 

ACAGTCTTCT AGCTGTTTCA AAGTTTTCTA TTCCTGTTTT ATAGACCTGC CATTCATGTA 9480 

ATAAAGGCTC T ACTC T T AAA GGAAGACCCG TAGCACAGAC CACATACGAA GCCGTTTCTA 9 54 0 

AAGCTCTTGT GACTGCAGAA GATACGATTA TTTCAGCTGA CGAGAGTAAA GGATTTTTGC 9 600 

TCAATTTCTG GACTTGCTGC CGTCCCATCT CAGACAAGGG TGCCAAATCT ATCCCAAATC 9660 

CTATATAAGA ACGCTCCTCT AACTCACGGT AATCTGGCTC CCCATGACGT ACAAAGATAA 972 0 

TCTTCATTCT AGTGCCCTGT CGATCCAAAT CCACCAGTTC GAACGCCATC AGCTGCATCT 97 80 

CCATCTGCAA TTAAGAAAGT AGCAAAAACA GCCTGGACAA TACGCTCCCC AACTTCAAGA 9840 

ACAACCTCTT GGTCTGTGAT ATTCTTCATC TGCGCAAAAA TATGCCCTTC ATTTCCAGGA 9900 

TTTCCATAAT AATCCCCATC AATGACTCCA ACTGAGTTAA TTAAAACCAA GCCCTTCTTA 9960 
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CGAGGATTTG AAGAACGATC ATAGAGGTAG AGAACCTCAG TCGGCTGCAT ATAAGCCTTA 1002 0 

ACCCCTGTCG GAACCAAGAC AATCTCTCCT GGCGCAACAA CTGTACGCAC AGCAACCTTT 10080 

AAGTCGTAAC CAGTCGCATG CGCTGTCTCA CGCTTGGGCA ATAAATTTTC ATCTGTAAAA 1014 0 

CTCGAAACCA ATTCAAAACC ACGAATTTTC ATAATTTTCT CTTTTCTATT ATCATTTATT 10200 

CT AG ATT ATT C TAT AC T TAT TTA 1022 3 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16535 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGTTCTGTC CTTATCGGCG CCTTGTCTTG CTTGCCATGG CTACACCAAC TATCTCATCC 60 

GACGAAAGTA CACCAACCAC TAACGAACCC AACAACAGAA ATACAACCAC CCTTGCCCAA 12 0 

CCTCTTACTG AT AC AG C AGC TGGCTCTGGT AAGAACGAAA GTGATATTTC TTCACCTGGA 180 

AATGCAAACG CTTCCCTAGA GAAAACAGAA GAAAAACCTG CTGCAAGCCC AGCCGATCCA 24 0 

GCACCACAAA CTGGACAAGA TCGTTCAAGT GAGCCAACTA CTTCTACTAG TCCAGTAACA 3 00 

ACTGAAACTA AGGCAGAAGA GCCCATCGAA GATAACTACT TCCGTATCCA TGTCAAAAAA 3 60 

CTTCCTGAAG AAAACAAGGA TGCTCAAGGA CTATGGACTT GGGACGATGT TGAAAAACCA 42 0 

TCTGAAAACT GGCCAAACGG AGCTTTGTCC TTCAAGGATG CCAAGAAAGA TGACTACGGC 480 

TATTACCTAG ATGTCAAATT AAAGGGAGAA CAAGCCAAGA AAATTAGCTT CCTCATCAAC 54 0 

AATACAGCTG GAAAAAATCT AACCGGCGAT AAATCTGTAG AAAAACTAGT TCCAAAAATG 600 

AACGAAGCTT GGTTAGACCA AGATTACAAG GTTTTCTCTT ACGAGCCACA GCCTGCAGGA 660 

ACTGTTCGCG TCAACTACTA C CG C AC AG AT GGCAACTATG ACAAGAAATC TCTCTGGTAC 72 0 

TGGGGAGATG TGAAAAATCC AAGTAGCGCT CAATGGCCTG ACGGAACAGA CTTTACGGCT 7 80 

ACAGGCAAAT ATGGCCGCTA TATCGACATT CCTCTTAATG AAGCCGCAAG AGAATTTGGA 84 0 

TTTTTATTAC TAGATGAGAG CAAACAAGGA GACGACGTGA AAATCCGTAA AGAAAATTAT 900 

AAGTT C AC AG ATTTGAAAAA TCATAGCCAA ATTTTCCTAA AAGACGATGA TGAATCGATT 9 60 

TACACAAATC CATACTATGT CCATGATATC CGTATGACAG GAGCCCAACA CGTAGGCACT 102 0 

TCTAGCATTG AAAGTAGCTT TTCAACACTT GTCGGTGCTA AAAAAGAAGA TAT C C TC AAA 1080 

CACTCCAACA TCACTAATCA CCTAGGAAAC AAGGTAACTA TTACCGATGT TGCAATCGAT 114 0 
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GAAGCTGGTA AGAAAGTGAC CTACAGCGGA GATTTCTCTG ACACAAAACA TCCTTATACT 12 00 

GTTAGCTACA ATTCCGACCA ATTCACTACC AAAACAAGCT GGCGCCTGAA AGATGAGACA 12 60 

TACAGCTATG ATGGCAAACT GGGAGCTGAC CTAAAAGAAG AAGGAAAACA AGTTGATTTG 132 0 

ACCCTTTGGT CACCAAGTGC TGATAAGGTT TCTGTTGTTG TCTACGACAA GAATGACCCT 13 80 

GACAAAGTAG TTGGAACTGT CGCTCTTGAA AAAGGGGAAA GAGGAACTTG GAAACAAACT 144 0 

CTAGACAGCA CAAACAAACT CGGAATCACA GATTTCACTG GCTACTATTA TCAATACCAA 1500 

ATCGAGCGTC AAGGTAAAAC TGTTCTTGCA CTCGATCCTT ACGCTAAATC TCTTGCTGCT 15 60 

TGGAATAGCG ACGATTCCAA GATTGACGAT GCCCATAAAG TGGCTAAAGC CGCCTTTGTA 162 0 

GATCCAGCTA AACTCGGACC TCAAGACTTG ACTTATGGTA AGATTCACAA TTTCAAGACT 1680 

CGTGAAGACG CCGTTATCTA CGAAGCTCAT GTGCGTGATT TCACTTCAGA TCCTGCCATT 1740 

GCAAAAGACT TGACCAAACC ATTTGGGACT TTTGAAGCCT TCATTGAAAA ACTAGACTAT 18 00 

CTCAAAGACT TGGGTGTAAC C CAT AT C C AG CTCCTTCCAG TCTTGTCTTA CTACTTTGTC 18 60 

AATGAATTGA AAAACCATGA ACGCTTGTCT GACTACGCTT CAAGCAACAG CAACTACAAC 192 0 

TGGGGATATG ACCCTCAAAA CTACTTCTCC TTGACTGGTA TGTACTCAAG CGATCCTAAG 1980 

AAT C C AG AAA AACGAATCGC AG AAT T T AAA AACCTCATCA ACGAAATCCA CAAACGTGGT 2 040 

ATGGGAGCTA TCCTAGATGT CGTTTATAAC CACACAGCCA AAGTCGATCT CTTTGAAGAT 2100 

TTGGAACCAA ACTACTACCA CTTTATGGAT GCCGATGGCA CACCTCGAAC TAGCTTTGGT 2160 

GGTGGACGCT TGGGGACAAC CCACCATATG ACCAAACGGC TCCTAATTGA CTCTATCAAA 222 0 

TACCTAGTTG ATACCTACAA AGTGGATGGC TTCCGTTTCG ATATGATGGG AG AC C ATG AC 2 2 80 

GCCGCTTCTA TCGAAGAAGC TTACAAGGCT GCACGCGCCC TCAATCCAAA CCTCATCATG 2 340 

CTTGGTGAAG GTTGGAGAAC CTATGCCGGT GATGAAAACA TGCCTACTAA AGCTGCTGAC 24 00 

CAAGATTGGA TGAAACATAC CGATACTGTC GCTGTCTTTT CAGATGACAT CCGTAACAAC 2460 

CTCAAATCTG GTTATCCAAA CGAAGGTCAA CCTGCCTTTA TCACAGGTGG CAAGCGTGAT 2 52 0 

GTCAACACCA TCTTTAAAAA TCTCATTGCT CAACCAACTA ACTTTGAAGC TGACAGCCCT 2580 

GGAGATGTCA T C C AAT AC AT CGCAGCCCAT GATAACTTGA CCCTCTTTGA CATCATTGCC 2 64 0 

CAGTCTATCA AAAAAGACCC AAGCAAGGCT GAG AAC T ATG CTGAAATCCA CCGTCGTTTA 2700 

CGACTTGGAA ATCTCATGGT CTTGACAGCT CAAGGAACTC CATTTATCCA CTCCGGTCAG 2 7 60 

GAATATGGAC GTACTAAACA ATTCCGTGAC CCAGCCTACA AGACTCCAGT AGCAGAGGAT 2 820 

AAGGTTCCAA ACAAATCTCA CTTGTTGCGT G AT AAGGAC G GCAACCCATT TGACTATCCT 28 80 
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TACTTCATCC 


AT G AC T C TT A 


CG ATT CT AG T 


GATGCAGTCA 


ACAAGTTTGA 


CTGGACTAAG 


2940 


GCTACAGATG 


GTAAAGCTTA 


TCCTGAAAAT 


GTCAAGAGCC 


GTGACTATAT 


GAAAGGTTTG 


3000 


ATTGCCCTTC 


GTCAATCTAC 


AG ATG C C TT C 


CGACTTAAGA 


GTCTTCAAGA 


T AT C AAAG AC 


3060 


CGTGTCCACC 


TCATCACTGT 


CCCAGGCCAA 


AATGGTGTGG 


AAAAAGAGGA 


TGTAGTGATT 


3120 


GGCTACCAAA 


TCACTGCTCC 


AAACGGCGAT 


ATCTACGCAG 


TCTTTGTCAA 


TGCGGATGAA 


3180 


AAAGCTCGCG 


AATTTAATTT 


GGGAACTGCC 


TTTGCACATC 


TAAGAAATGC 


GGAAGTTTTG 


3240 


GCAGATGAAA 


ACCAAGCAGG 


ACCAGTCGGA 


ATTGCCAACC 


C G AAAG G ACT 


TGAATGGACT 


3300 


GAAAAAGGCT 


TGAAATTGAA 


TGCCCTTACA 


GCTACTGTTC 


TTCGAGTCTC 


TCAAAATGGA 


3360 


ACT AG C C ATG 


AGTCAACTGC 


AGAAGAGAAA 


CCAGACTCAA 


CCCCTTCCAA 


GCCTGAACAT 


3420 


CAAAATGAAG 


CTTCTCACCC 


TGCACATCAA 


GACCCAGCTC 


CAGAAGCTAG 


ACCTGATTCT 


3480 


ACTAAACCAG 


ATGCCAAAGT 


AGCTGATGCG 


GAAAATAAAC 


CTAGCCAAGC 


TACAGCTGAT 


3540 


TCACAAGCTG 


AACAACCAGC 


ACAAGAAGCA 


CAAGCATCAT 


CTGTAAAAGA 


AGCGGTTCGA 


3600 


AACGAATCGG 


TAGAAAACTC 


TAGCAAGGAA 


AATATACCTG 


CAACCCCAGA 


TAAACAAGCT 


3660 


GAACTTCCAA 


ATACAGGAAT 


CAAAAACGAA 


AACAAACTCC 


TATTTGCAGG 


AATCAGCCTC 


3720 


CTTGCGCTCC 


TTGGTCTCGG 


TTTCTTACTA 


AAAAATAAAA 


AAGAGAACTA 


AACTAGCCCT 


3780 


CCTATAGAAA 


AATCCCCCAA 


GCATTATAGC 


TCGGGGGATT 


AATTTTTGTA 


CAATATTTGT 


3840 


TGTCCTAATA 


AACTTGATTA 


GGATTTTTTA 


TTAAGCCTCT 


TTCATAGCAA 


AATAAGCTCG 


3900 


TACTTTGGGT 


GCAACTTGTG 


TTCCGAAGAG 


TTCAATAGCT 


CTCAGAACCT 


GGTCATGAGG 


3960 


CATAGAACCA 


AGCGGTAGAT 


GAAGCATGAA 


GCGGTCCAAT 


CCTAAATCCT 


CTATCATGCG 


4020 


AATCAATTTT 


TCGGCCACCT 


GATCTGGATT 


GCCAACAAAC 


ATGGCGCCAT 


TTGGCCCTAC 


4080 


CTGCTCCAAA 


TATTGCTCAT 


AACGCAATTC 


CTGCCAGTGC 


GGACGGTCTT 


TGGAAATAGC 


4140 


ATCCACCACT 


TGCTTAGTCG 


GATGGAAATA 


ATCTTTCACC 


GCCTGCTCAC 


CATCTTCCGC 


4200 


AATCCACCCC 


CAAGAATGGG 


CTCCCACTTT 


CAAGTCTTTG 


TCAGCATGGC 


CCCTTCGCTT 


4260 


CCAATCTCAC 


G AT AAGCC TG 


AATCAACTTT 


TTAAAATAAC 


GTGGATTACC 


ACCAATAATA 


4320 


GCATATACAA 


TCGGTAGACC 


AGCCTGAGCA 


ATCTTCACTG 


TTGATTCGAC 


ATGACCACCT 


4380 


GTAGCTATCC 


ACAAGGGCAA 


TTTGTCCTGA 


ACTGGACGAG 


GATAAACTTC 


TTTACCAGCA 


4440 


ATCGTTTGAG 


TCAATCGACC 


TTGCCAGTCT 


AACTTGGTCT 


TTTCATTGAC 


TAACTGAAGC 


4500 


AAGTCTAATT 


TCT C AT C AAA 


AAGAGAGTCG 


TAGTCTTTCA 


AGTCATAACC 


AAACAGAGGG 


4560 


AAAGATTCCG 


TGAAAGAGCC 


CCTTCCAGCC 


ATAATCTCCG 


ATCGTCCATT 


TGACAAAGCA 


4620 


TCGATAGTGG 


CATACTGTTG 


GAACAAACGA 


ATCGGGTCCA 


TGCTTGACAG 


AATGCTGACT 


4680 
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GCACTGGTCA AACGGATTTT CTTGGTATTG ACTGCCCCAG CGGCCAGAAC AATCTCTGGG 474 0 

GCTGATACTG CAAAATCCGC CCGATGGTGC TCACCAATCC CATATACATC CAAACCAACC 4800 

TTGTCAGCCA GCTCAATCTC TGCCACCAAC TGGCGAATGC GTTCAGCATG ACTGTAAGTT 4 8 60 

TGTCCAGTCC CTTCAAGCTC CGTTATTTCC CCAAATGTTG AAATTCCCAA TTCTACCATT 4 92 0 

GTGATTCTCC T T AT CT AT C T CTGTACTTCA ATTTGAAAAA T T ATT C T AAC ACGAATCTTG 4 980 

AGTACAAGCA ACCGATTTGC T C ATT AG AAA AAGCCTAGAT AACTAGACTT TTTTAGCTTA 504 0 

TTCTACCGTT ACTGACTTGG CAAGGTTACG TGGTTTGTCC AC AT CG AGGC CACGGTGGAG 5100 

GGTTGCAAAG TAAGCGACTA ATTGCGTTGG TACGACCATT GAAATTGGTG AGAGGTATGG 5160 

ATGTACGGTC GTAAGGACGA TATCGTCGGT ATCTTTGGCT ACATTCTCTT CTGCGATAGT 522 0 

GAGGACTTTG GCACCACGGG CTGCGACCTC TTGGATATTT CCACGAGTAT GATTGGCAAG 5280 

AACTGGATCT GACAAGAGAG CCAAAACAGG CGTTCCTTCT TCAATCAAGG CAATGGTTCC 5340 

GTGCTTGAGT TCTCCTGCAG CAAAGCCTTC AC AC TGG AT A TAAGAAATCT CTTTGAGTTT 5400 

GAGACTTGCT TCCATGGCTA CGTAGTAATC TTGACCACGT CCGATGTAAA AGGCGTTACG 54 6 0 

AGTTGTTTCA AG AAGT T C AC GAACCTTGAC TTCAATGGTT TCTTTCTCTG AAAGAGTTGA 552 0 

TTCGATAGAC TGAGCTACGA TTGACAATTC ATGAACCAGG TCAAAGGCTT GCGCTTTAGC 5580 

ATTACCATTT GCTTCTCCGA CTGCTTTTGC AAGGAAGGCA AGGGCTGCGA TTTGCGCTGT 564 0 

ATAGGCTTTA GTTGATGCCA CGGCAATTTC AGGACCTGCG TGAAGGAGCA TGGTATAGTT 57 0 0 

GGCTTCACGT GAGAGGGTTG AACCTGGAAC GTTTGTCACT GTTAAGCTTG GAATTCCCAT 57 60 

TTCATTAGCC TTGACCAAAA CTTGACGACT ATCCGCTGTT TCACCAGATT GGCTGATAAA 5 82 0 

GATGAAGAGT GGTTTCTTGC TGAGAAGTGG CATACCGTAG CCCCACTCAG ATGAGATTCC 5 880 

AAGTTCAACT GGTGTATCTG TCAATTCTTC CAACATTTTC TTAGAAGCAA ATCCTGCATG 594 0 

GTAAGATGTT CCAGCTGCAA GGATGTAGAT GCGGTCTGCG TCTTGAACAG CCTTAATGAT 6000 

ATCTGGGTCT ACGACAACTT GACCAGCCTC ATCTGTGTAG GCTTGGATGA GTTTCCGCAT 6060 

AACAGTTGGT TGCTCGTCAA TTTCCTTGAG CATGTAGTAA GGGTAAGTTC CCTTACCGAT 612 0 

ATCTGACAAG TCAAGTTCAG CAGTGTAGCT AGCACGCTCA CGACGATTTC CATCATAGTC 6180 

TTGAACTTCC ACACTATCAG CCTTGACGAT TACCAACTCT TGGTCATGGA TTTCCATGTA 6240 

TTGGTTAGTT TCACGAATCA TAGCCATGGC GTCTGAGCAG AC C ATGTT AT AGCCTTCTCC 6300 

AAGACCAATC AAAAGTGGTG ATTTATTTTT AGCTACGTAG ATGACTTCAG GATCTTGTGA 63 6 0 

GTCAACCAAG GCAAAGGCAT AAGAACCACG GATGATGTGA AGGGCTTTTT TGAAGGCTTC 6420 
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AAGAACTGAG AGCCCTTCTT CTTCCGCAAA TTTTCCAATC AAATGAACGG CTATTTCAGT 6480 

ATCTGTCTGC CCCTTGAAGT GGTGACCTGC AAGGTATTCT TCCTTGATTT CAAGATAGTT 654 0 

CTCAATCACC CCATTATGCA C C AAG AC AAA ACGTTCCGTC TCAGAGCGGT GTGGGTGAGC 6 600 

ATTGTCCTCA GTTGGTTTTC CGTGAGTAGC CCAACGAGTA TGTCCGATAC CAGTTGTTCC 66 60 

CTCAACACCA GCTGTCTTGG CAGACAATTC TGCAATACGA CCAACCGCCT TC AC C AAATG 6720 

GTTATCAGCA C CAT CT AGGA CAAAAATTCC CGCAGAATCA TAGCCACGGT ATTCAAGCTT 67 8 0 

TTCAAGCCCT TGAATCAAAA TATCAGTTGC ATTTGTGTTT CCAACAACAC CAACAATTCC 6840 

ACACATAGTA TATACGACAC AGGCAAGCTG TGCTTTCTCC TTAAAATTGG TATAGTCTAA 6900 

TTCATCTTTT ATAGAATCAG CAAAAACAGT ATATACTTGT TTCTTTCACT TGTCAAGAGT 69 6 0 

AAAAATTGGT ATAGTTCAAA TTAAGCTCCT GT AAG C AT AA AAACTCTGAC CGATTGGGAT 7 020 

AATCAGTCAG AGTCCTTTTT AAAATCCATT ATTATCGCTT AATTCTTTGA ACCAGTGGCC 70 80 

TGATTTCTTC AGACGACGTT CTTGCGTTTC CAAGTCTAAT TCGACCAAAC CATAGCGATT 7140 

TTTATAGCTG TTGAGCCATG ACCAGCAGTC AATAAAGGTC CAAATCAAGT AGCCCTTACA 72 00 

GTTGGCACCA TCTTCAATGG CACGGTGAAG TTCACGAAGA TGACCTTTTA CAAAGTCAAT 72 60 

ACGGTAATCA TCTTGAATCA TTCCATCTTG ACGGAATTTT TCTTCCCCTT CAACACCCAT 732 0 

ACCATTCTCA GTCAACATCC ACTCAATATT GCCATAATTT TCCTTGATAT TTTGGGCGAT 73 80 

GT CAT AAAT C CCTTGCTCAT AAATCTC C C A ACCACGGTGA GAATTGATTT TACGTCCAGG 744 0 

CATCACATAA GGCTCGTAAA AATGTTCTGG TAAGAGTGGA CTCTCTGGAT GCTTAGCAAA 7500 

TCGAGGAGCC AT AAC AC GC A AAGGT TG AT A GTAGTTCACA CCAAGGAAGT CCACCGTATT 7560 

ATCACGAATG AGTTCCAACT CTTCCTCTGT AGCATCAGGT AAAAGACCGT GTTCATGCAA 7 62 0 

GATTTCTACC AACTCCTGTG GATAAGTCCC CAAGACAGAT GGATCTAAGA AAGATTGGGC 7680 

CTGAAAAAGG GCCGCAATAC GAGCTGCCTT GACATCAGCA GGATGCTGGC TACGTGGATA 7 74 0 

AGCCGGTGTC AAGTTGAGGA CAATCCCAAT CTTGGAATCA GGCAAAAGTT CATGGCAAGC 7 800 

CTTAACAGCC CGGCTGCTGG CCAATTGTGT ATGATAGGCT ACCTTAACAG CTGCCTCTGC 7 8 60 

ATCCACCTTA TGTGGATAAT GGGCATCATA AAAAT AAC C A AATTCTACAG GAACGATGGG 7 92 0 

CTCGTTAAAG GT AATCCAT T GATCCACTAA ATCTCCATAA GTCTCAAAAC AAAAACGAGC 7 980 

ATAGTCTTCA TAGGCTGAGA CTGTCGCCTT ATTTTCCCAA CCATCACCAT CCTCTTGAAG 804 0 

GGCAAAAGGT AAAT C AAAAT GATAGAGATT GACTAACAGA CGAATTCCTT TAGCCTTAAT 8100 

AGCCTCAAAG ACCTTACGAT AAAAAT CC AC ACCTTGAGTG TTGACTTTTC CACAGCCTTG 8160 

TGGAAAAATC CGTG AC C AC T GAATAGAAGT CCGAAAGGCT GTGTGACCAG TCTCTAACAA 82 2 0 
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AAGCTCAATA TCCCGCTCCC AATTTTCATA AAAAGTCGAT GTCTTATCTG AACCAATCCC 82 80 

ATTATAGTAA CGATTTGGCT CCACTTGGAA CCAGTAATCC CAGAGATTGT CTCCCTTACC 8340 

GTCACCAGCT ACACGTCCTT CTGTCTGCGG TCCAGAAGTA GAGGATCCCC AGACAAAATC 84 00 

CTTTGGAAAT CTTAGCATAC ATTTACCTCT TTATCTACTC ATTTCTCCCA TTATACAGAA 84 60 

AAAACAAGGT AAAAACTAGT TACATTTTTT CCTTGTTTTT CTTCTGATTA TAGTTTTTAT 8520 

TTCTTGCTTA GGATTTCAAG CGTTTCAAGC ACGTTATCTG CATGAACCTC AATGGTGTCA 8 580 

CCAGTTGCCT TGATCTTAAC TTCTACAATG CCATCGGCCG CTTTTTTACC AACAGTGATA 8 640 

CGGATTGGAA GACCAATCAA GTCACTATCG CTAAATTTAA CACCGACACG TTCGTTACGG 8700 

TCATCTGTCA AGACTTCATA ACCAGCTCCC ATCAAGCTTG CTTCAAGTTT TTCTGTCAAG 87 60 

GCTTGCGCTT CTTCATCCTT G AC AT TG AC A GTAATCAAAT GCACATCAAA TGGTGCCAAT 8 82 0 

TCTTTAGGGA AATTGATTCC CCAAGCGTAA CGGTATTCAC CTTTTGGCGT TTTGTTAACA 88 80 

AAGAGGCGAG CGTGTTGCTC CATCACTGCT GAAAGAAGAC GGCTGACACC GATACCGTAA 8940 

CATCCCATGA TGATTGGCAC AGCACGACCA TTTTCATCCA AGACATCTGC TCCCATGCTT 9000 

GCTGAATAGC GAGTTCCGAG TTTGAAAATA TGACCGATCT CAATACCACG CGCAAAGTTA 9 0 60 

AGGACACCTT GTCCATCTGG GGAAATTTCA CCCTCACGAA CTTCACGGAT ATCCACATAT 9120 

TCTGCAGTAA AATCACGGCC TGGGTTCACA CCAGTCAAGT GGTAGTCATC TTCGTTAGCA 9180 

CCGACAACTG CATTGCGAAC ATCTTGTACC TTACGATCTG CAATAATTTT AATATTCTCT 9240 

GGCAAACCAA CTGGTCCAAG TGAACCAAAT CCTGCTTGAA CAACATTCGC CACTTCTTCT 9300 

TCGCTAGCAA CGTCAAAGAA ATCTGCTCCC AAGTGATTTT TCAACTTGAC TTCGTTGAGT 93 60 

TGGTCATTTC CAACTAGAAG GGCTGCAACA AGCTCACCAT CTGCAATGTA GAAGAGGGTT 942 0 

TTAATCGTTT GTTCTTCTGG AACATTGAGG AAGGCTGCAA CTTCATCAAT TGATTTAACA 9 480 

TCTGGCGTTG CAACACGAGT AACTTCTTCT TCAGCGACAA CACGGTTGCT TGGTTTGTAC 9540 

TCGTTTGTTG CCATTTCTAA GTTAGCTGCA TAGCTAGACT CACTTGAGTA AGCAATGGTA 9600 

TCTTCACCAG AGACTATCCA TTTGAGCAAT TCTGCCTTGA TTTCTTCTTG CACTTCTGCA 9660 

GGAATTTCGT CAAATGAGGC AACTGACTTG TCCAAGACAA CCCAGCGGTC AAGGTCTGTA 9 72 0 

CGAGCAGATG TAATGGCCAT AAATTCTTGG CTATCCTTAC CACCCATGGC TCCACCGTCA 9780 

CCAATAATAG CCTTGAAGTC T AAAC C ACT A CGAGTGAAAA TACGCTCATA GGCTGCTTTG 984 0 

TACTCATCAT AAAC ACTAT C C AAACT AT C A TAGTTAGCGT GGAAACTATA AGCATCCTTC 9900 

ATGAT AAAC T CACGTGTACG AAGAAGTCCA TTACGCGGGC GTTTTTCATC ACGATACTTG 9960 
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GGCTGAATTT GATAAAGGTT GAGTGGCAAT TGCTTGTAAG ATTTAACAGA ATCACGGACA 10020 

ATAGCTGTAA AGGTTTCTTC GTGAGTTGGA CCTAAGATAA AGTCTGATTT TTCACGGTTT 100 80 

TTTAGTTTGT AAAGGTCTTC ACCATAGGTT TCGTAACGAC CTGATTCACG CCACAATTCT 1014 0 

GCACTAAGAA GGGCTGGAGC CAACATCTCA ACAGCACCAA TCTTTTCGAA TTCTTGGCGC 10200 

ATGATGTTTT TAGCTTTTTC AATCACACGG TTGGCAAGTG GTAGATAAGA ATAAACACCT 102 60 

GCTGAAACTT GGCGAACATA ACCAGCACGC AACATAAGAG CATGGCTGAT AACTTGAGCA 10320 

TCGCTTGGCA TTTCGCGAAG CGTTGGGATA GGCATTTTAC TTTGTTTCAT AATATTCCTC 10380 

GAT TAT C T AA AAAAGAGTCG C AT AATGT C A TTCCAAGTCA CAGCAATCAT CAAGACAACC 10440 

ATGATGACCA CTCCGGCCAA GGTGACATAG GTTTCAATTT CTTGTTTCAA TGGTTTGCGG 10500 

CGGATGGCTT C TAGG AT AT T GAGCACAATC TTACCACCAT CCAAGGCTGG AATCGGAATA 10560 

AGATTAAAAA TCCCAATATT GATGGAAATC ATTGCCAAGA AGTACAAGAT ATTTTCAATT 1062 0 

CCATTTTTAG CAGCATCACT ACTTGCCTTA AAGATAGCAA CAGGTCCACC CAACTTGTTC 10680 

AAATCTGGTT GG AAAAT C AG ATTTTTCAGA GCTGAGAGAA TTCGGAGAGC TGAGTCAGCA 10740 

GCAGTTGTAA AACCACCTAC AAACATGGAT AG AAAAT CTG ACTTAACCCC CGGTTGAACA 10 800 

CCTAGAAGGT AACGACCTTG ACTATCTTTG GGTGTAACAG TGACTTGTTT GTCACTCCCC 10860 

TTT T C AG AAA TAGTCACATC CAAAGTCGGT GCCGTCTTAT CTTTGGTTTC TGTTTCCACA 1092 0 

GCTTGGATCA AGCTTTCCCA GTTGCTAACC TCATGTGAGC CAATCTTGGT AATTTGTGCC 1098 0 

ATTTCTGGTA CTCCTACCTT GGCCAAGGCA CCTTGGGGCA TG AT AT GG AA CTGATTGGTA 11040 

TCAACATCTC TGACACCACC CTGCATAAAG ATTAAAACCC AAAAAACAAC GACACCTAAG 11100 

AT AAAATT GT TCATAGGACC T GC AAAAT T G GTAATCAGTT TGCCCCAGAT AGTCGCATTT 11160 

TGATATTGAA CATCTAAAGG TGCAATCCGA ACCTCAGTAC CATCTGCTTC CACAACCGTT 112 2 0 

GCATCGTGAT CCACTGCAAA TGTTTTTTCT TCTTCCAGAA CCAATCCTTT G AT AAAG AG C 112 80 

TTGTCTTCAA AATCAAACTG GGTCACCTGC ATAGGGAGGG CTGTTTGATC CAATTTTTTA 11340 

CCTGAGAGAT TGATGCGTTT AACCTTACCA T C AT C AG C AA GTGTCAAACT AACAGGCGTT 114 00 

CCTGTCTTGA TTTCAGTTGT ATCATCACCC CAACCGGCCA TGCGGACATA GCCACCCAGA 114 6 0 

GGCAAGATTC GAATGGTATA GGCCGTTCCA TCCTTGCCAA TGTGAGCAAA AATTTTAGGT 11520 

CCCATACCGA TGGCAAATTC ACGTACTAAA AT C C CTG AT T TCTTGGCAAA GTAGAAGTGA 11580 

CCGAACTCGT GCACCACTAC AATAATCCCG AAAAC C AG AA TAAAGGTTAA AATTCCGAGC 1164 0 

ATAGCGTTTC CTCCGTCTTT TGATTAAAAG AGTCCAAATA AGTGCATGAT TGGAAATACA 11700 

AGCAACATAC TATCGAAACG AT C C AAAAC A CCACCATGTC CAGGGATAAA TT T C C C AG AA 117 60 
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TCCTTAACAC CAAAATGACG TTTGATCGAA CTTTCTAGTA AATCACCAAA TTGTCCAGCA 1182 0 

ATGCTAAAGA AAAT AG C AAA GACTGACATC TTGTAAATTC CATATGGAAG AGCAACTGTA 11880 

CTGTCAACTA TCATAAGGAT AATGGTTACT AAAATTGCTC CTAAAATACC ACCCAAGGCA 11940 

CCCTCAAGGG TTTTATTAGG CGATACCCTT GGTGCTAACT TTCGTTTCCC ATAGTTCATC 12 00 0 

CCAACAAGAT AGGCACCACT GTCTGTCGCC CAGACGATAC ACAAGGCTAA GAGAGCCTTG 12 060 

TCCAAACCTG CAACACGAGC ATCTAGTAAA GCATTAAATC CAAAGCCCAC GTAGAAGCTC 1212 0 

ATAGCAAGAG GGAAAACCGC ATCCTCAATC GTATAAGACT TGCTAAAAAC GGTCGTTCCT 12180 

AACATGATTG AAATCAAAAC ACTATAGGCA ACCACATTCC CATCAACTGG CAAAAAAGTC 1224 0 

AGGTAATTCT CCAAGGGAAT GGTCAATGCA AAGGTTGCAA AGAGGGTCAA GAGGCCCTCC 12 300 

ATCGTCATGG TCTCTAGACC TCTCATCTTC AAAAGTTCAT GCATGGCTAG CATGGCTATG 123 60 

ATTCCGATTG CTATCTGAAG CAAGAGGCCC CCAATCATTA AAATTGGTAG GAAAATAGCC 12420 

AGGGCAATCC CTGCAAACAA GGTTCTTTTC TGTAAATCCT GGGTCATATT TCCTCCTAAA 12 4 80 

CTCCTCCAAA TCGGCGATGA CGACGATTAT AGGCAAGAAT AGCTTCCTGC AAGGCCGCTT 12 540 

CGTCAAAATC AGGCCATAAG GTGTCCGTAA AATAAAGCTC ACTATAGGCT CCCTGCCATG 12 600 

GAAGGAAATT GCTCAAACGT AATTCTCCAC TAGTACGGAT AATCAAGTCT GGGTCTCGTA 12 66 0 

AGTCCTTAGG CAAATGCTGA GTAAAGAGAT AGTTACCAAT CAATTCCTCT GTGATGTCAC 12 72 0 

CTGGGTTGAT TTTGGCATCT AAAAC AT C C T GGGAAATCAA CTTAAGCGCC TGTGTAATCT 12 780 

CAGCACGTCC ACCATAGTTA AG AG C AAAAT TAAGAATCAA TCCTGTGTTG TTCTTAGTCA 12 840 

ATTCCTCAGC CTTGGTTAAA GCTTCAAAGG TTTGCTTAGG CAGGCGGTCT GTCTCCCCAA 12 900 

TCATTTGAAT CTTAACATTA TTCGCATGTA GTTCCGGGAC ATAATTATCA TAAAACTCTA 12 960 

CTGGCAAGTT CATGATAAAC TTGACTTCCT GATCTGGACG GGTCCAGTTT T C CGT AG AAA 13020 

AAG C AT AG AC CGTAATAACC TTGACGCCCA GTTTGTTGGC TGCCTTGGTC ACGGTTTGCA 13 080 

ATGCTTCCAT GCCCGCCTTA TGTCCAAAAA CTCGCGGTTG CATACGTTTT TTAGCCCAAC 1314 0 

GGCCATTGCC AT C C ATGATG ATGCCGATAT GAGCAGGAAC CTGTGTCGGA ACCTCTACTT 13 200 

CCACAGCCTT ATCTTTCTTA AAAAATCCAA AC ATG AT CTT ATTCCTATTC AAAAATCTAT 132 60 

CGTTTCATTA TACCATATTT CCCCATTTTC TTCTATCACT AAGCTATTTA TTCTCAGGCA 13 320 

CCAAGCCCAT TTTTCAAAAA AATAAGCCGC CTGATTGGGC GACTTTATTT TTATAGGGAG 133 80 

ATTATTATGA AAAAGT TTT A GGAGTTTAAG TTAAGGTCTT CTTAACTTAT GAACTTAGTG 1344 0 

TACACTCCCT AGCTTAAAGT TTCCTTAAGT ATTTTTAAAA ATCAAATTTT TCCATTTCTC 13 500 
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CTGCCAATTT TTCTTGGATA AACGTGTTTG ATAGAGTTCC ATTCGGTCTT CATTTTCTAA 13560 

GAAATGAGGA GTTGGACGAA CTTGAAAATT CAAAATATCC T C C AAAC CAT AAGGTACATA 13 62 0 

GAGTTCAAAA TCTAATTCTT CATTCAAGCG CAGTCCAACT GCCGTACACC GTTCTGGATA 13 680 

CTTACTCATA GCATCACGAG AACTGGTATA GGAAGCAGTG TGAGGACTGT GCTGATGCAT 13740 

AT AG AC C TGA TTTTTCAATT CCCACTGGTA CTGAGGAAAA TCCTCTCTCA GCTTTTTCTC 13800 

CAGTAATAAG GTTTCCTCAT AAGAAAAATC TGGATCAAAG AAAAT C AC AT CTATATCTGT 13 860 

TTCATGATCA AAAGGGGATT TGTCTGACAA AAGATTCCAG ATGAAATTTC TGACAGAACC 13 92 0 

TGCTGCCAAC CACGAGTCTT TCAAACCAAG GTCTCGGATG ATCGTCAGAA TGGCCATCAT 13 980 

ATCTGGACTT TCTCTAAAAG CCTCTAAGAT TTCTTGCTTA TTTTTCACTG TATTCATAAC 1404 0 

CTAAGTGCTC AT ATGCC T T A GCAGTCGCCA CCCGTCCAGA CCGTGTCCGC ATGATAAAAC 14100 

CTTTTTGAAT CAAGTAAGGC TCATACATGT CTTCAACTGT CTCACGCTCT TCGGCGATAT 14160 

TCACAGAAAG AGTTCCTAGA CCAACAGGTC CTCCACTGTA CATCTCAATC ATGGTGCGAA 142 2 0 

GGATTTTTTG ATCCACATAG TCCAAACCTT CATGGTCAAC ATCCAGCATA GTCAAAGCCT 142 80 

TATCGGTAAT AACATCATCG ATAACCCCAT TCCCCATTAT CTGGGCAAAA TCGCGCACGC 1434 0 

GCTTGAGGAG ACGATTGGCA ATACGAGGGG TTCCACGACT ACGTAGGGCC AACTCAGATG 14400 

CTGCCTCATG GGTGATTTCC ATCTCAAAAA TATCTGCCGT CCGCTCGACA ATTTCTGTCA 144 60 

AGTCAGCATG AGCATAATAC TCCATATGAC CTGTAATCCC AAAACGTGCC CGTAGTGGAT 1452 0 

TTGAGAGCAT ACCAGCCCGA GTCGTCGCAC CAATCAAGGT AAAAGGAGGC AACTCCAAAT 145 80 

GAACACTGCG ACTGCCTTCA CCAGCCCCAA TCATAATATC GATGTAGAAG TCCTCCATGG 14 64 0 

CACTATAAAG CACTTCTTCC AC TG AC ATGG GTAAGCGATG AATCTCGTCA ATAAAGAGGA 147 00 

CATCTCCAGG CTCTAAATCA TT C AAAAT CG CTACCAAATC ACCCGCTTTT TCGATAACAG 147 60 

GACCAGACGT TTGCTTGAGA TTGACTCCCA GTTCATTGGC AATGACAAAA GCCATGGTTG 14 82 0 

TTTTCCCAAG CCCTGGAGGG CCAAATAAGA GCACATGATC CAGCGCTTCA TCCCGCATTT 14 8 80 

TAGCGGCTTC G AT AAAG AT C TGAAGTTGAT CCTTAACCTT ATCCTGACCA ATATATTCAC 1494 0 

GTAAATACTG AGGACGGAGC GTGCGTTCTA CTAACTCCTC ATCACCCATC ATCT C AT TAT 15000 

CTAAAATTCT ACT C ATGG CT CTATTATATC AAAAAAAACA AG C C AC AAAC AAAAAAGCCA 15060 

CCTGATTGGG TGACTCCTAA GTTTAGCACT TATGTGGTAT AATATTATAC GGCACTTCTA 15120 

CACCGCCTAC GAAAGGAGGT GAGATAGCCC ATGATGGAAT TAGTACTCAA AACTATTATC 15180 

GGACCAATTG TGGTCGGTGT CGTTCTTCGT ATAGTCGATA AATGGCTAAA CAAGGACAAA 15240 

T AGTGT C AAA AAAGACCTCA AGCTTATTTG GTCGTGAGCT TGGGGTCTTT TCTAGCCTAT 153 00 
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GATATAGAAC TAGTACTCAA TTCCTTTTTA T T ATC C CAT A GTTCACGAAT TTTGTCAAAA 153 60 

CT T T AC AT T T TCTTCAACCG CTGTACGACA AGACGGTTAA GATTAAGAGA ACGTTAGGGA 15420 

TTCTATCAAT TT CAT AG AAA TTTTGATTTC GTAAACGAAG AGACAATCTT ACATGTCACT 15480 

TCTCATTTAA TACGCCACTA C T AG AC AAGC AAAATCATTA TTACAGTAGT TCCAGTCCTT 15540 

CAATTAACAG TCACTTACAA TCAAATTGAG TTTGAACTAG CTGAAGCGAC CACAGACCTA 15600 

TTTCTTAGTC AT ATTCGC T A AAAAAATCCC CGCCAAAATC TCAAAAAGTC CCCGCCAATT 156 60 

CCCCGACCAA AATCCGAAAA AT AC CG AAAA AT AT CG AAAA ATTATTTTTA GAATAGTCCC 15720 

AAAAATCCTG AAATAGAGCT AAAAAACTCC ACCTGATTCG GTGGAGTTAA GGGAGATTAT 157 80 

TATGAAAAAG AAAAGT T TAG GATTTTATTA AATAAAGTTA GGAGGTCTTT ATTTAATAAC 15840 

TACATGATAC AAGACGAAAC TTAAAACTAG CTTAACTTTT CTAAAATTTT ACTATTTTGC 15900 

AAAAAATTTC TATCACCAGC ACCTCACCAA TCGAGTAGGG GATAATCTCT AGCCCCTCTC 15 960 

ACACCACCGT ACGTGCCGTT TGGCATACGG CGGTTCAACT AACTTTTAAC GCATGTCGTT 16020 

CAAGGTAATA ATCCAAACAC GAAACCAGTC CACGTTTTTC CAGGACTGGT TTTGATATAG 16080 

CACGTTTAAG TACCGACTTC TGAGCTACTA ATTGATAATG GTCGCCCCAG CCAGATACCT 16140 

T ATC TG CT AT CCATTTAGGA ACTCCTAACT TAAGCAATCC CCATAATCGT CTCGATTTCT 16200 

TCTTCCATTG CTTCCAGATA ATCACTCGTA GGCGAGTACG CAAGCGCTCA TCTATGCTGG 16260 

C G AC TAT ACT TTTCATATTT CCCAATGAGC AATAGTTTAT CCATCCTCGA ATAGACAAAT 16320 

TCAGTTGCTC AATACGTCTT GTTAGGTCTA TACTCCATTT CCTCTGTGTT AGTTTCTTCA 163 80 

ATTTAAACTT AAATCTCCGA ACACTATCTT GATGTGGACG GCTTTTCCAA CCATCTGATA 16440 

ATTTCCAGAA CCCAAAACCT AGATATTTCA ACTCTCTTGG TCATGTTTAC TTTCAAACCT 16500 

AGCCGTTTCT CAATAAACGA CTGACTGAAT AC ATC 1653 5 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CCAGAGCGTT GCGTCCGAAA GTCTATCCAG ACACGGCTCT TTAAAAACAA AAGGAGAAAT 60 

GATGCATACT TATTTGCAAA AGAAAATTGA AAATATCAAA ACAACCCTAG GTGAAATGTC 120 
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AGGTGGTTAC 


CGTCGTATGG 


TTGCGGCTAT 


GGCTGATTTA 


GGATTTTCAG 


GAACTATGAA 


180 


GGCTATCTGG 


GATGACCTCT 


TTGCCCATCG 


TAGTTTTGCC 


CAGTGGATTT 


ATTTGCTGGT 


240 


TTTAGGAAGT 


TTTCCTCTCT 


GGCTGGAGTT 


GGTTTACGAA 


CATCGTATTG 


TTGACTGGAT 


300 


TGGGATGATT 


TGTAGCTTGA 


CAGGGATTAT 


CTGTGTAATC 


TTTGTATCGG 


AAGGTCGAGC 


360 


AAGTAATTAT 


CTTTTTGGCT 


TGATTAACTC 


TGTTATTTAC 


CTTATTTTGG 


CCCTACAGAA 


420 


AGGCTTTTAT 


GGTGAGGTGC 


TGACGACACT 


TTACTTCACA 


GTCATGCAGC 


CAATTGGACT 


480 


TCTAGTTTGG 


ATTTATCAGG 


CACAGTTTAA 


GAAGGAAAAG 


CAGGAGTTTG 


TCGCGCGTAA 


540 


ACTGGACGGC 


AAGGGCTGGA 


CAAAGTATCT 


TTCCATTAGT 


GTGCTTTGGT 


GGTTGGCCTT 


600 


TGGCTTCATT 


TATCAGTCTA 


TTGGTGCCAA 


TCGTCCCTAT 


CGTGATTCAA 


TCACAGATGC 


660 


AACCAATGGG 


GTAGGGCAAA 


TCCTCATGAC 


AGCTGTTTAC 


CGTGAACAGT 


GGATATTCTG 


720 


GGCGGCTACC 


AATGTCTTTT 


CAATCTATCT 


CTGGTGGGGA 


GAAAGCCTGC 


AAATTCAAGG 


780 


GAAATATCTA 


ATTTATCTCA 


TTAACAGTCT 


AGTTGGTTGG 


TAT C AATG G A 


GCAAGGCAGC 


840 


TAAGCAGAAT 


ACTGATTTAC 


TTAACTAGGA 


AAAGATGTTT 


GAAAGTGCTG 


T T T TG AG AT T 


900 


TCGATTAAAA 


CAGATATAGT 


TGATAATCAA 


GGATTTATAG 


TATGAAAAAG 


AGGATCGGCG 


960 


GGTCCTCTTT 


TGTTGTTGAA 


AAGATAAAAA 


ACTCAGTAAC 


CTAGAAATAA 


GACAACTGAA 


1020 


GCTTTACTCT 


ATATTCAATT 


TTTAGGAATG 


AGAAGGTCTA 


GATAAAATTG 


GACAACTTCC 


1080 


TGGTCTGTGA 


AATCTTGACC 


TTTTTTGAGC 


CACCAGGTCA 


ATGTCTCGAT 


AAAGTTGGAC 


1140 


ATGACCAAGT 


GTTGGAGGTA 


AGAAGTAGGC 


AGATTAGGGT 


GGGCTTCTTT 


TAAATTATCA 


1200 


GCTAGCACGG 


AATAGACATG 


GTGTTCTAGC 


TCTTTATGGA 


GTTGACGGAG 


GAAGTAGTCA 


1260 


TTTTTGGAAA 


ATAGCAGACT 


GGTGATATGG 


TCTTGGTTTT 


TATGAAAATG 


GAGAAAGAGG 


1320 


TGGGCGAGGT 


AGTCCTCGGT 


TGAAATGGCT 


TGCTCTCTTT 


CAAAAAGATG 


ATGGAAGAGG 


1380 


TAGCGGCAGA 


GCTGGTCCAG 


AAGAAGCTCC 


TTACTCTCAT 


AGTGACAGTA 


AAAGGTGGAT 


1440 


CGTCCCACAT 


CTGCGAGATC 


AATGAT AT C C 


TGAACAGTAG 


TGGCCTCGTA 


GCCCTTAGCA 


1500 


TTCAAAAGTT 


GTATAAAAGC 


T TG AT AG AT G 


GCTTTTTTGG 


TTTTGCTGAT 


ACGGCGGTCA 


1560 


ATGTTAGTCA 


TATGGACACT 


TAAGGCAAAT 


TGTTCAGAAC 


TGAATAAAGC 


TGACGTTTTG 


1620 


CTTCTATCCT 


TTCTTTGAGT 


TTTAGTGGAT 


AATGAT AATG 


AACAAGGTGT 


TCATAAATCT 


1680 


ATTATAACAA 


AGGAATGAGA 


AATATGAAGG 


CAAAATATGC 


TGTTTGGGTG 


GCTTTTTTCT 


1740 


TAAATTTGAC 


TTATGCCATT 


GTTGAGTTTA 


TTGCAGGTGG 


AGTATTTGGT 


TCTAGCGCTG 


1800 


TTCTTGCTGA 


CTCTGTGCAT 


GACTTGGGAG 


ATGCGATTGC 


AATTGGAATA 


TCAGCTTTTC 


1860 


TAGAAACAAT 


CTCCAATCGT 


GAAGAAGACA 


ATCAGTACAC 


CTTGGGCTAT 


AAGCGGTTTA 


1920 
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GCCTGCTAGG AGCCTTGGTA ACAGCTGTGA TTCTCGTAAC GGGCTCTGTT CTAGTCATTT 1980 

TGGAAAATGT CACGAAGATT TTGCATCCGC AACCAGTCAA TGATGAGGGG ATTCTCTGGT 2 040 

TAGGAATTAT TGCGATTACT ATCAATCTGT TAGCGAGTCT GGTGGTTGGT AAGGGAAAGA 2100 

CAAAGAATGA GTCTATTCTG AGTCTGCATT TTCTGGAAGA TACGCTAGGG TGGGTAGCTG 2160 

TTATCCTGAT GGCGATTGTT CTTCGATTTA CGGACTGGTA TATCCTAGAT CCTCTTTTGT 2220 

CCCTTGTCAT TTCTTTCTTT ATTCTTTCAA AAGCCCTTCC ACGTTTTTGG TCTACACTCA 2 2 80 

AGATTTTCTT GGATGCTGTG CCAGAAGGTC TTGATATCAA GCAAGTAAAG AGTGGCCTGG 2 340 

AGCGATTGGA CAATGTGGCC AGCCTTAATC AGCTTAATCT CTGGACTATG GATGCTTTGG 24 00 

AAAAAAATGC CATTGTCCAT GTTTGTCTAA AAGAAATGGA ACATATGGAA ACTTGTAAAG 2 4 60 

AGTCTATTCG AATTTTCCTA AAAGATTGTG GTTTTCAAAA TATTACCATT G AAATTG AT G 252 0 

CTGACCTAGA AACTCACCAA ACCCATAAGC GAAAGGTGTG TGACTTGGAA CGGAGTTATG 258 0 

AGCATCAACA TTAGAAAAAA GTGAAAAATA CTTGGGTACT ATCTTATTTG GAATAGAGTA 2 64 0 

ATTTCTTTAT TATTTAAATA TTTCAAAAAT TGGTAAGAGA AGAGCATTGT ATAAACTCCA 2 7 00 

GATATATGAT TGTTAATGAT AAAAATTTTT CG AT TAG AT A CAAAATGCTT GACTTGGAGT 27 60 

CAACTCAAAG TTATATAATA AG AT AAG T G A GTTAGAATAG CGTGAATTCA GTGAATGAAA 2 820 

TGAGAGGAGG TTAGCGTGTG AATATTAAAT CTGCCAGTGA TTTGTTGGGA ATTTCAGCGG 2 8 80 

ATACGATTCG GTATTATGAA CGGGTTGGTC TTGTGCCACC GATTACTCGT ACTGCTACTG 2940 

GGATTCGTGA TTTTCAAGAT CAGGATATCG AAGCGCTGGA AT TT ATT AAG TGTTTTCGTT 3000 

CGGCGGGTGT CTCTGTAGAT AGTTTAGTTG AC T AT ATGT C GCTCTACCAA AAGGGAGATG 3 060 

AAACGAGAGA GGAGAGGCTT GGTATTTTAG AAGAGGAAAA GCAAAAATTA GAGGAGCGCT 312 0 

TGTCTCAGCT AC AG AC AG CT TTAAATCGTT TAAATCTCAA AATTAAACTT TATAAGGAAG 3180 

GAAAATTTTA AATGAAATCA GCAGTATATA CAAAGGCAGG TCAGGTTGGA CTTGCTAGCA 324 0 

TTGAACGTCC GCAAATAATA GAAGCGGATG ATGTGATTAT TCGTGTGGTT CGTGCGTGCG 3 3 00 

TTTGTGGTTC AGATTTATGG AGGTACCGTA AT C C AG AAAC GAAAGCTGGA CACAAAAATA 3 3 60 

GTGGACACGA AGCGATTGGG ATTGTTGAAG AAGCTGGGGA AG C C ATT AC G ACGGTGAAAG 342 0 

CAGGTGATTT TGTGATTGTC C CTTT T AC AC ATGGATGTGG TGAGTGTGAT GCCTGTCTTG 3480 

CTGGATTTGA CGGTTCTTGC GACAATCATA TTGGCAATAA TTTGGGGGGT GATTTTCAGG 3 54 0 

CAGAATATAT TCGCTTCCAC TATGCAAACT GGGCGCTGGT TAAAATCCCT GGTCAACCTT 3 600 

C TG ACT AT AC AGAAGGGATG CTCAAGTCCC TTTTGACTCT TGCAGATGTC ATGCCGACAG 3660 
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GCTATCATGC 


GGCGCGTGTT 


GCAAATGTTC 


AAAAAGGGGA 


CAAGGTTGTT 


GTTATCGGTG 


3720 


ATGGGGCTGT 


TGGTCAATGT 


GCTGTCATCG 


CGGCTAAGAT 


GCGTGGAGCA 


TCACAAATTA 


3780 


TCCTTATGAG 


CCGTCATGAA 


G AC CGT C AAA 


AGATGGCTAT 


GGAGTCAGGT 


GCGACAgcTG 


3840 


TTGTTGCAGA 


ACGTGGTCAA 


GAAGGAATTA 


CCAAGGTGCG 


TGAAATCCTC 


GGTGGAGGAG 


3900 


CAGATGCAGC 


ACTTGAATGT 


GTTGGTACGG 


AGGCTGCTAT 


AGAACAGGCG 


CTAGGTGTTC 


3960 


TTCATAATGG 


AGGGCGTATG 


GGCTTTGTAG 


GAGTCCCACA 


CTATAATAAT 


CGTGCTCTTG 


4020 


GTTCGACATT 


TATGCAAAAT 


ATCTCTGTAG 


CAGGTGGGGC 


AGCTTCTGCT 


ACAACATACG 


4080 


ATAAGCAATT 


T T T AC T AAAA 


GCCGTCCTTG 


ATGGTGATAT 


CAATCCAGGT 


CGCGTCTTTA 


4140 


CTTCAAGTTA 


TAAACTGGAA 


GATATCGACC 


AAGCCTATAA 


AGATATGGAT 


GAACGTAAGA 


4200 


CAATTAAGTC 


TAT G ATTGT A 


ATCGAATAAA 


AAACGAATAG 


GAGTTTTAGA 


ACTCTATTCG 


4260 


TTTTTTATGT 


TATCCTATTC 


TTGATTTAGG 


GTACTTTCTC 


TTAATGTCAG 


TCTGGTTCCC 


4320 


AGCATGGTCA 


GGCTAGGGAT 


TTTCCGACCG 


TGGAGGACTT 


CCTTGTTAAG 


AATATCCATA 


4380 


CCTGCTCGGC 


CCATTTCTTC 


AGTATAAACT 


GTAATACTAG 


AGAGGGGAGG 


AT AG AC CTG T 


4440 


TTGGTCAGAC 


TAGTGTCGTT 


AAAGGAAATG 


AGGCTGACGC 


GAT C TGGC AG 


GCTGATTCCA 


4500 


GCTTCTTGGA 


GGGCACGGAG 


GGCACCGATA 


GCTAAACTAT 


CGCTGGCTGC 


GAAAAATGCT 


4560 


GGCGGAAGTT 


GGTCTCCCAA 


GCTCTGAATG 


GCCTCCTTCA 


TTAAGTCATA 


GCCAGACTGG 


4620 


GCAGTAAATC 


TTCCTTGAAA 


GACCAGTTCA 


TCATGATAGA 


TTCCCCTCGC 


TTG AC TAT AG 


4680 


TTTTTGAAGT 


TTTCTAGACG 


CTTGTCCTGA 


ATGATTTCTT 


CTTGGTCTGT 


TGTTTCTTCA 


4740 


AGGCCTGTTA 


GAATCCCGAT 


ACGGTCCATT 


CCTTGACTGA 


GGAAATAATC 


GACAACCTGT 


4800 


TTC AT AG C AG 


TGTAAAAATC 


CGTGATAATA 


CAGGTATGTC 


CCAGGGAAAG 


TGTATCGCTG 


4860 


TCTAGAAATA 


CAAGAGGCTT 


TTGGTATTCT 


TCAAAGGCAG 


AAATCTGAGC 


TCGACTAAAC 


4920 


TTTCCGATGC 


AGAGAATCCC 


AATCACTTCC 


TCGCTTAGGG 


TAAAAGGGTG 


GTCATTAAAA 


4980 


TAGCGCAAGA 


TATCATAGTC 


CAACTCTTGG 


GCTCTTTTTT 


CTATTCCTAG 


GCGAATCTGG 


5040 


TAGTAGTAGA 


GGTCGTCCAG 


CTCCCCTTGT 


TCGCTGACCC 


ATTGGATAAT 


GGCAATCTTT 


5100 


TGCTTGGGTT 


TGTGGGACTC 


GCCTGTCTTG 


AGGTGCTTGG 


TGTAGCCCAG 


CTCTTCAGCA 


5160 


ACGGT T AAAA 


TACGGTGTCT 


GGTTTCTTCT 


GTAACAGATA 


GGCTCTGGTC 


GCGGTTGAGG 


5220 


ACGCGGGATA 


CGGTCGCGAT 


AGAGACAGAG 


GCTAGCTGTG 


CAATGTCTTT 


TAAGGTAGCC 


5280 


AT AAATCC TC 


CTTGATTAGG 


TTAGTATATC 


ATGTTTTTCT 


TCTTTTTACT 


GATATTTTAC 


5340 


TAAAATTTTA 


GTAAAAAGGA 


TTGACCTTGG 


AAAATTCCTT 


GGATATAATA 


GAAAGAAAAC 


5400 


GATTACACGT 


TAAGATGGCT 


TAACGGACAG 


TCAAAGGAGA 


ATTCATATGG 


CACAACATCT 


5460 
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TACTACTGAA GCCCTTCGCA AAGACTTTCT TGCTGTTTTT GGTCAAGAAG CAGATCAAAC 552 0 

CTTCTTTTCA CCAGGCCGCA TTAATTTGAT TGGTGAACAC ACAGACTACA ACGGTGGGCA 55 80 

CGTTTTTCCT GCTGCTATTT CCTTGGGAAC TTACGGTGCA GCTCGTAAGC GTGACGACCA 564 0 

AGTCTTGCGT TTCTACTCAG CTAACTTTGA GGACAAGGGC ATTATCGAAG TGCCTCTCGC 5700 

TGACCTCAAG TTTGAAAAAG AGCACAACTG GACCAATTAT CCAAAAGGTG TCCTTCATTT 57 60 

CTTGCAAGAA GCTGGGCACG TGATTGACAA AGGTTTTGAT TTTTATGTTT ATGGAAATAT 582 0 

TCCAAATGGT GCTGGCTTGT CTTCTTCTGC ATCCTTGGAA CTCTTGACAG GAGTCGTGGC 5880 

TGAGCATCTC TTTGATTTAA AATTAGAGCG TCTCGATTTG GTTAAAATCG GCAAACAAAC 5940 

AGAAAACAAC TTTATCGGAG TAAACTCTGG CAT T ATGG AC CAGTTTGCTA TTGGTATGGG 6 00 0 

GGCAGACCAA CGTGCTATTT ACCTAGATAC TAATACTTTA GAATACGACT TGGTGCCACT 60 60 

TGATTTGAAG GACAATGTCG TTGTTATCAT GAACACCAAC AAACGCCGTG AATTGGCGGA 612 0 

CTCTAAATAC AATGAACGTC GTGCTGAGTG TGAAAAAGCA GTGGAAGAAT TGCAAGTTTC 6180 

CTTGGATATT CAGACTCTGG GTGAATTGGA CGAGTGGGCC GTTGACCAAT AT AG CT AT C T 624 0 

GATTAAAGAT GAAAATCGTT TGAAACGTGC TCGCCATGCT GTGCTTGAAA ACCAACGTAC 63 00 

CCTCAAAGCT CAAGTAGCAC TCCAAGCAGG AGATTTGGAA ACATTTGGAC GCTTGATGAA 63 60 

TGCGTCACAC GTTTCTCTGG AGCATGATTA TGAAGTAACT GGTTTGGAAT TGGATACCCT 642 0 

TGTTCACACA GCTTGGGCAC AAGAAGGAGT TCTCGGTGCT CGTATGACAG GGGCTGGTTT 6480 

TGGTGGCTGT GCcATTGCCT TGGTTCAAAA AGATACTGTT GAGGCCTTTA AGGAAGCTGT 6540 

AGGCAAACAC TACGAGGAAG TAGTTGGATA CGCTCCAAGC TTCTATATCG CTGAAGTTGC 6 6 00 

AGGTGGC AC T CGCGTCCTTG ACTAGTCAAA AGGAGGCTCT AT AGTG AC CT TAGTAAATAA 6 660 

ATTTGTAACA CATGTCATTT CTGAAAGCTC ATTTGAGGAA ATGGATCGAA TCTATCTGAC 672 0 

CAATCGTGTT TTGGCACGAG TGGGAGAAGG TGTTTTGGAA GTTGAGACCA ATCTGGATAA 67 8 0 

ATTGATTGAC CTCAAGGACC AGCTGGTTGA AGAAGCCGTT CGATTAGAGA CGATTGAGGA 6840 

TAGTCAGACT GCGCGTGAAA TCCTTGGTGC TGAACTGATG GATTTGGTGA CTCCTTGTCC 6900 

AAGTCAGGTC AATCGTGATT TTTGGGCAAC CTACGCCCAC TCTCCAGAAC AAGCGATAGA 6960 

GGATTTTTAC CAACTCAGTC AGAAAAATGA CTACATCAAA CTCAAGGCCA TTGCTAGAAA 7020 

TATCGCTTAT CGTGTTCCAT CTGACTACGG AGAACTTGAA ATTACCATCA ATCTCTCTAA 7 080 

GCCTGAAAAA GATCCCAAAG AGATTGTGGC AGCCAAGTTG GTGCAAGCTA GTAATTATCC 7140 

TCAGTGTCAG CTTTGTCTAG AG AAT GAGGG CTACCATGGT CGAGTTAACC ACCCAGCTCG 72 00 
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TAGCAATCAC CGTATTATCC GTTTTGAAAT GGTTGGTCAG GAATGGGGTT TCCAGTATTC 7 2 60 

GCCCTATGCT TACTTTAATG AGCATTGTAT CTTTTTAGAT GGCCAGCATC GTCCCATGGC 7 32 0 

CATTAGTCGT CAGAGTTTTG AACGTCTGTT GGCTATCGTA GACCAGTTTC CAGGATATTT 73 80 

TGCTGGATCT AATGCCGACC TGCCGATTGT GGGGGGCTCT ATTCTAACTC ATGATCATTA 7440 

TCAGGGAGGC CGTCACGTAT TTCCTATGGA ATTGGCTCCC TTGCAAAAGG CCTTCCGATT 7 500 

TGCTGGTTTT GAGCAGGTCA AGGCTGGAAT TGTCAAGTGG CCCATGTCTG TCCTACGTTT 75 60 

GACTTCGGAT TCCAAAGAGG ATTTGATCAA TTTGGCTGAT AAGATTTTGC AGGAATGGCG 7 62 0 

CCAGTATTCA GATCCTGCAG TGCAGATTTT GG C AG AG AC A GACAGGACAC CGCATCACAC 7 680 

TATCACACCC ATTGCCCGCA AACGCGATGG ACAGTTTGAG TTGGACTTGG TCTTGCGAGA 7 74 0 

CAATCAGACT TCAGCAGAGT ATCCTGATGG TAT C TAT CAT CCCCACAAGG ATGTCCAACA 7 8 00 

TATCAAGAAG GAAAATATCG GCTTGATTGA GGTCATGGGC TTGGCAATCT TGCCACCACG 78 60 

TCTGAAAGAA GAAGTGGAGC AAGTCGCTAG CTATCTTGTA GGAGAAGCTG TTACAGTTGC 7 92 0 

CGATTATCAT CAGGAGTGGG CAGACCAACT CAAATCCCAA CATCCAGACT AACGGATAAA 7980 

GAAAAAGCCC TTGCAATCGT CAAGGACTCT GTGGGTGCTA TCTTTGCGCG TGTACTTGAG 8 040 

GATGCAGGAG TCTACAAGCA GACAGAACAA GGG C AG AC AG CCTTTATGCG CTTTGTGGAA 8100 

CAGGTCGGAA TTTTACTAGA CTAGGAGCTT TCTCGG 813 6 



(2) INFORMATION FOR SEQ ID NO : 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 76: 

CCCATAGTGA AGAGTGGCCA TAAGAAGGTC TTCTAGGCTT AATTTAGGTT TTCGTCCACC 60 

TTTTGCGTGT TTAAGTTGAT AAGCTGTTTT TAACACAGCT GAACATCTCT TCAAAAGTCG 12 0 

TGCGCTGAAC ACCAACAAGA CATTTAAATC GTGTATCAGT TAGTTGTTTA CTTGCTTCAT 18 0 

CATTCATAGA ACTACTATAC CATGTTTTGT TTCGCAGGAA GTCTAATATT GTCAAATACT 24 0 

GGAACGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGGATACCT 3 00 

GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 3 60 

TCTTCGTTTA CATCTTTCAC CATAACTGCA TCACAGTGAA CATCGTAACC ACGGTTTGAA 420 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 4 80 
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GCAAGAATTT 


TAATCATTTG 


GATTTCCTCC 


GATTTTATTT 


TTTAATAGAC 


AAGATTAAGC 


540 


GGTTGCTTCA 


GCAATGTAAG 


CATAAAGGGC 


TTCTGGTTCA 


GAAATTTTTG 


ATAGGTCTTC 


600 


AAGATGACCA 


TTTCCTGTGA 


AGAAGTCCAT 


TAACTGAGCA 


AGAATGTTCG 


TTTGACTTGA 


660 


ACTTGAATTA 


TTGATGATAA 


AGAAGAGCAA 


GGATACTTCT 


ACTTCCTTAC 


CTGGCGCAAT 


720 


CATATTATGG 


AAAGTCACCG 


GTTTCTCTAA 


TCGAACAACC 


ACCACTTTCT 


CAGCTAGATT 


780 


ATGAACAATA 


TCTGTGTGAG 


GAATCATTAC 


ATTTGCAAGT 


CCTTTCCTAG 


AAAT TC CAT A 


840 


TATAAACCAG 


TTGGAAATGA 


CTTTTCACGC 


GTGATCAAGG 


CTTCACGATA 


AGTTGGAGTG 


900 


ACAATTTCTC 


GTTCTTCCAA 


CAAGCTTGCT 


ACCTGATCAA 


AAAGTTATTC 


TTGATTATCC 


960 


GCTTCTAAGC 


AAAACACAAG 


GTTTTTGTCA 


AAGAAATAAT 


CT AAT AC CAT 


AAGGTTTTCC 


1020 


CTTCTTTCCA 


TTAACTTTAT 


GCTATAAGTA 


TAACACTATA 


TGAAATCGTT 


GTT AAT TACT 


1080 


TTCTATTCTT 


TTTTGTCTCT 


TTTTTTATAT 


TTTTGTTTTG 


TTTATAGTTT 


GTTATATAAA 


1140 


AATAAACACA 


CAAACAAATA 


CTCCAAGCAT 


TTTTCTGTTC 


TAATACTCAA 


TGAAAATCAA 


1200 


AGAGCAAACT 


AGGAAGCTAG 


CCGCAGTTGT 


TCAAAACACA 


GTTTTGAGGT 


TGTAGATGAA 


1260 


ACTGACGAAG 


T C ACT C AAAA 


CATGGTTTTG 


AGGTTGTAGA 


TGAAACTGAC 


GAAGCAACAg 


1320 


CCATACATAC 


GGTAAGGCGA 


CGCTGACGTG 


GTTTGAAGAG 


ATTTTCGAAG 


AGTATAAAAA 


1380 


CTAAAAAAGC 


AGACCATCTA 


AGCCTGCTTT 


ACTATTGATT 


CTTATATAAA 


TTTCCTGTGA 


1440 


ACAAGGAAAG 


GCATTTCTGA 


TAACTTATTC 


TTCATCCATA 


CTCAAGACGC 


TGAGGAAGGC 


1500 


TTCTTGCGGA 


ACTTCAACTG 


ATCCGATGGA 


TTTCATGCGT 


TTCTTACCAG 


CTTTTTGTTT 


1560 


TTCAAGGAGT 


TTACGCTTAC 


GAGAAACGTC 


ACCACCATAA 


CAT T TAG C AA 


GTACGTTCTT 


1620 


ACGAAGGGCC 


TTGATATCAG 


TACGAGCGAC 


AATCTTGTGT 


CCAATAGCCG 


CTTGGATTGG 


1680 


AACTTCAAAT 


TGTTGGCGAG 


GGATGATTTT 


CTTGAGTTTA 


TCAACGATGA 


GTTTCCCACG 


1740 


TTCGTAGGCA 


AAGTCCTTGT 


GAACGATAAA 


GCTGAGGGCA 


TCCACCTTAT 


CTCCATTGAG 


1800 


AAGAATATCC 


ATTTTCACCA 


GCTTAGATGG 


G CG AT AT T CT 


GACAATTCGT 


AGTCAAAGCT 


1860 


TGCATAACCA 


CGTGTCGAAG 


ACTTAAGTTT 


ATCAAAGAAG 


TCAAAGACAA 


TTTCAGCAAG 


1920 


AGGAATTTGA 


TAGATAACAT 


TGACACGGTT 


ATCATCAATA 


TAGTCCATAG 


TCACAAAGTC 


1980 


CCCACGCTTA 


CGCTGAGCTA 


GCTCCATTAC 


TGCTCCGACG 


AACTCCTGTG 


GTACCATGAT 


2040 


TTGCGCCTTG 


AC AT AAGGC T 


CTTCAATGGT 


CGCAATCTTA 


GTTGGGTCTG 


GAAACTCAGA 


2100 


TGGGTTAGAC 


ACATCCATAG 


ACTCACCGTC 


GGTCAAATTA 


ACTTTGTAAA 


TAACAGACGG 


2160 


AGCTGTCATG 


ATGAGGTCAA 


TATTGAACTC 


ACGCTCTAAA 


CGTTCCTGGA 


TAACATCCAT 


2220 
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ATGGAGAAGT 


CCAAGAAATC 


CACAACGGAA 


ACCAAATCCA 


AGTGCCTGAG 


ATGTTTCTGG 


2280 


TTCAAACTGA 


AGACTAGCAT 


CATTCAGTTG 


CAATTTTTCA 


AGcGCTTCAC 


GCAGGTCATT 


2340 


GTACTTGTTT 


GATTCGATTG 


GGTAGAGACC 


CGCAAAGACC 


ATAGGATTCA 


TCTGCTTATA 


2400 


ACCATGTAAT 


GGTTCTGCCG 


CAGGATTGGT 


TGCCAAGGTA 


ACGGTATCAC 


CCACACGAGT 


2460 


ATCCTGAACC 


G T CTTG AT AG 


ACGCCGCAAT 


GTAACCAACA 


TCACCAGTCG 


CAAGGAAATC 


2520 


ACGACCAACC 


GCTTTTGGTG 


T AAAAAT AC C 


GACTTCGGC C 


ACATCAAAGG 


TCTTACTATT 


2580 


GCTCATGAGC 


TGAATCTTAT 


CACCAGGTTT 


GACCACTCCG 


T CC AT G AC AC 


GCACTTGGAG 


2640 


GATAACCCCA 


CGGTAAGCAT 


CGTAAACAGA 


GTCGAAAATC 


AAGGCCTTAA 


GTGGCGCCGT 


2700 


CACATCACCC 


GTTGGTGCTG 


GTACTTTTTC 


TACAATTTGC 


TCGAGGATTT 


CTTCAATCCC 


2760 


AATACCAGCC 


TTGGCAGAAG 


CCAAAACTGC 


TTCACTGGCA 


TCCAAACCAA 


TCACATCTTC 


2820 


AATCTCTGTA 


CGCACGCGCT 


CCGGATCTGC 


AGCCGGCAGG 


TCAATTTTAT 


TAATGATAGG 


2880 


CATGATTTCC 


AAATCATTAT 


CCAAAGCCAG 


ATAAACGTTG 


GCAAGAGTTT 


GAGCCTCAAT 


2940 


TCCTTGAGCC 


GCATCGACCA 


CCAAAATAGC 


ACCCTCACAG 


GCAGCTAGCG 


AACGTGAAAC 


3000 


TTCATAGGTA 


AAGTCAACGT 


GCCCTGGTGT 


GTCAATCAAG 


TGGAAAATAT 


AAGTTTCCCC 


3060 


ATCTTTTGCA 


GTGTAATTCA 


ACTCGATGGC 


ATTCAACTTA 


ATAGTAATTC 


CACGTTCCCG 


3120 


CTCTAGCTCC 


ATGCTATCCA 


AAAGCTGGGC 


CTGCATTTCA 


C G ACTTG AAA 


CCGTCTCTGT 


3180 


TTTTTCCAAA 


ATGCGGTCTG 


CTAGAGTTGA 


TTTTCCGTGG 


TCAATATGGG 


CGATAATAGA 


3240 


GAAGTTACGG 


ATCTTCTCCT 


GTCGTTTTTT 


CAATTCTTCT 


AAGTTCATGA 


TTCTCTTCCT 


3300 


TTCAGGGTAT 


CTATTTATTA 


TAAATTGTTT 


TTGATATTTT 


G AC AAG AC C A 


TACCCTGCTA 


3360 


GGAGTACTAA 


TCTTCAGCGA 


CAAAGCCGTC 


ATTTTCGATA 


AAGTGGTGTT 


CTGTCATTCC 


3420 


TTGGTCTGTA 


AAGACAATCC 


CGTGAAGGAC 


ACCACCATAA 


ACAGCTCCTC 


CATCCATTCC 


3480 


AATCTTGCCA 


TCTTCTGTAG 


TCCAAAGCTC 


AGATGTACCG 


CGTTCTTGCT 


GTAACAAACC 


3540 


ATAGACCGGT 


GT AT GACCG A 


AGACAATGGT 


TTTTCCAGTA 


TGATTTTCAG 


CTCCGTGGAA 


3600 


TGGTTTTCTA 


AGCCATACTT 


TTTTATAATC 


TGTTGTTTCA 


TGCCAGTCGT 


CCAAGGTCAA 


3660 


ATCAATACCT 


GCGTGAACAA 


AGATATACTT 


GTCTGTCTCT 


AC T AC AAATG 


GCATTTGACG 


3720 


AATGAATTCG 


ACCAAGTCTG 


CCGCTTCAGC 


GgCAACCCGC 


TTGGCATCTT 


CTACTCCATC 


3780 


AACTGGTGCA 


TCCAAGGGAC 


GACCTAGGAT 


AGAGTTAATG 


GTTGTATCTC 


CACCATTGCG 


3840 


ACTATAATGG 


TCATAACTTT 


CTTCTGGGTC 


ATCTAGCCAA 


GT C AAAAAC A 


TATACTCGTG 


3900 


GTTTCCGGAC 


AAACAGATAG 


CCCCTTGATT 


GTCCACCAAG 


TCCTTGACCA 


TTTCAAGAAC 


3960 


ACGGTGACTA 


TCCTCACCTC 


TGTCAATCAA 


ATCACCTAGA 


AAGAGCAACT 


GGGGCTGACC 


4020 



WO 98/18931 



PCT7US97/19588 



625 



ATCCCAGGTT 


TTGAGAAGGT 


CTTCCAGCAT 


CCCAGCTTTT 


CCGTGAACAT 


CTCCAATTAC 


4080 


AT AAT AAT C T 


GTCATCTTAT 


TTCTCCCTGT 


TTCTCAACAA 


TTCTCTTGCT 


TGCGTCAGGG 


4140 


CTGCTTCTGT 


CACATCATCA 


CCTGCCAACA 


TCTTGGCAAC 


TTCCTCCACT 


CGCTCTTCGA 


4200 


CCGTCAAGAG 


ACGAACAGTC 


GAAACCGTTG 


AATGGTCATT 


ACTAATCTTC 


TCAATAAAGA 


4260 


ATTGATAATC 


TGCAATCGCA 


ATTACTTGTG 


GCAAATGGGA 


GATAGCCAAA 


ACCTGACCAT 


4320 


GCTGACCAAT 


TTTATGAATT 


TTCTGAGCAA 


T AG CTTG AG C 


AACACGACCT 


GAAACTCCCG 


4380 


TATCCACCTC 


ATCAAAGACA 


ATGCTAGTCT 


TGCCTTCTTT 


ACGTGAAAAG 


GC AG AC T T AA 


4440 


TGGCTAACAT 


GAGACGAGAT 


AATTCCCCTC 


CAGAAGCAAC 


CTTAACCAAG 


GGTTTAAAGT 


4500 


CTTCTCCAGG 


GTTGGTTGAA 


ATATAAAACT 


CAACCATTTT 


ATTTCCCTCA 


CGACTGAATT 


4560 


TTCCCTTACT 


AAAACGAACC 


TGAAACTGGG 


CTTTTTCCAT 


AT AAAG AT CT 


TGCAGTTCTT 


4620 


GTTTAATCTC 


AGCTTCGAGT 


TGCTGAGCCA 


AATTATGACG 


AG C AG AAG C A 


AGTTGACCTG 


4680 


C C AAATTG AC 


AAGATTGACT 


TCCAACTTCT 


TAAGCTCTGC 


TTCCATGTCC 


T C AG AC GAAA 


4740 


GATTATTGCC 


TGTCAAGAGA 


TTGTATTCTT 


CCGTAATCTT 


GGCAAAATAA 


AGCAAAACAT 


4800 


CATCAACAGT 


CCCACCATAC 


TTACGAGTAA 


TAG TAT G AAG 


GAGGTCCAAA 


CGATTCTCAA 


4860 


CCTGCATCAG 


GCGATTGCCA 


TCAAAATCAA 


GGTCCTCAAT 


GAT AG C TT C C 


AAACGTTTGC 


4920 


TAATGTCTTC 


TAAAACATAG 


TAGGTCTCAG 


AC AGAT AG C T 


TGAAATTTCA 


CGGTATTCAG 


4980 


GAT CAT AC T C 


TTCGACACTT 


TCCATGTCAT 


TCATAGCTGA 


ACGAACATTG 


GCCAGACTTG 


5040 


AAAAATCTTC 


ATTGTCCAAC 


ATACTGTAGG 


CATTGGTCAG 


TGTATCCGCA 


ATATTTTTGT 


5100 


GGTTGAGGAG 


TTTATCTCGC 


TCTTGATTGA 


GAGCCAAGTC 


TTCTCCAGCC 


TGCAAGTTTG 


5160 


CTGCCTCAAT 


CTCTGCCATT 


TGAAATTCCA 


ACATTTCGAT 


ACGTGCCTTG 


TGTTCCTGTT 


5220 


GGTTTTTCTT 


GACTTCCAGA 


ACCTGCTTGC 


GCATTTTCCG 


ATAGGCATCA 


AAACTCGTTT 


5280 


GATAGGTTTC 


TTTCAAGTCC 


CAAAAAGCGG 


C ATCAC C AAA 


TTCATCCAAC 


ATCTGGATAT 


5340 


GCAGTTGGGG 


ACGCATTAAC 


TCCTCATGGT 


CATGCTGACC 


ATGAATATCT 


ACAAGATGTT 


5400 


GCCCAATAGC 


TCGCAAAACA 


GACAGATTAA 


CCATCTGACC 


ATTTACACGG 


CTGATACTAC 


5460 


GACCATTTTG 


CAAGATTTCC 


CGACGGATGA 


TAATTTCATC 


ACCTAATTCT 


AAACCTTGCT 


5520 


CATCAAAAAT 


TTCCTGTAAA 


AGACGACTAT 


TCTCAACTGA 


GAAAAGCCCC 


TCAATCTCTG 


5580 


CCTTTGGTGC 


ACCATGACGA 


ATAACATCTG 


TCGTCGCACG 


AGCTCCCAAC 


ATCATATTCA 


5640 


TGGCATCAAT 


GATAATCGAC 


TTCCCTGCAC 


CCGTTTCACC 


AGTCAGGACA 


GTCATCCCCT 


5700 


TTTCAAAATT 


GAGGGAAATA 


GCCTCAATAA 


TGGCAAAGTT 


TTTTATCGAA 


ATTTCAAGTA 


5760 



WO 98/18931 



PCT7US97/19588 



626 



ACATATAGAC 


CTACCAATTT 


TTTACTTGTT 


CAAAGATTTC 


CTCTGCTAGA 


CTTCCACTTC 


5820 


TGGCAATGAC 


TAAAATCGAG 


C TAT CATC AG 


TCAAACAGCT 


AAAAATCTTG 


TCTGCAAAAG 


5880 


T CT C G ATT AA 


CTGAGCTTTT 


AC AAAAG C C G 


TATTTCCTGG 


AATAACTTGG 


AG AT TG ATC A 


5940 


TCTTATCCAT 


CAATTCAGCC 


GATTCGATAT 


TGTCTTCAGC 


CAGTTGCAGA 


CTTTTTACGA 


6000 


TTGATTTTGG 


CAATTCGTAG 


ACATAGGTGT 


TGTCTCTCAA 


AGGAATTTTG 


ACAATACCTA 


6060 


ACTCTTTGAT 


ATCTCGGGAT 


ACCGTCGCCT 


GAGTGGCAGT 


GATACCTGCT 


TCTTTCAAAT 


6120 


GTTCTACAAT 


TTCTTCTTGC 


GTGCCGATTT 


GATAATCTGT 


CACCAATCTT 


CTAATTTTTT 


6180 


CAAGTCTCTC 


TTTTTTATTC 


ATTTTTAAAT 


TGACTATGCG 


CCCTCTCTAC 


TGCTTCTTTA 


6240 


ATCTCAGCAA 


GAATCTGATT 


GCTTGCTGAC 


TTTTCTTTTT 


TCAAATACGC 


T AAAAATT C A 


6300 


ATATTTCCAT 


GTCCACCTTG 


GATGGGAGAA 


AAGTCCAAGC 


CAAGGACTGA 


AAAACCTACC 


6360 


TCTACTGCCA 


TAGCTGTTAC 


AGATTCAAGG 


ACATTCTGAT 


G AAC CT TAG C 


ATCTCGAATA 


6420 


ATTCCATTTT 


TCCCAATCTG 


CTCACGTCCT 


GCCTCAAACT 


GAGGTTTGAC 


AAGTGCTACC 


6480 


ACCTGACCTT 


GATCAGCCAA 


GACACGGTGC 


AAGGCTGGCA 


AAAT C AG ACT 


AAGGGAAATG 


6540 


AAACTCACAT 


CAATACTGGC 


AAAGCTCGGC 


TCCTGCTCGA 


AATCAGTCTT 


TTCAGCATAG 


6600 


CGGAAATTGA 


ACTGCTCCAT 


GCTGACAACT 


CGTGGGTCTT 


GGCGTAATTT 


CCAAGCCAAC 


6660 


TGATTGGTAC 


CAACATCGAC 


TGCAAAGACC 


AACTTGGCAC 


TAT T CTGT AG 


CATGACATCG 


6720 


GTAAAACCTC 


CAGTAGAGGC 


CCCGATATCA 


ATCGTAGTCG 


CGCCATCCAC 


CGACAAATCA 


6780 


AAGACCTGCA 


AGGCCTTTTC 


CAGTTTCAAA 


CCACCACGGC 


TGACATACTT 


GAGTTTCTCC 


6840 


CCCTTGAGTT 


TTAATTCGGT 


GTCATCTGGA 


ATTTTCTCTC 


CTGGCTTGTC 


AAACCGTTCT 


6900 


CCATTAAGGA 


CTGCTACGAC 


TAGGCCAGCC 


ATCACACCTC 


GCTTGGCCTG 


CTCTCTCGTT 


6960 


TCAAACAACC 


CCTGTTTATA 


AGCTAGTACA 


TCCACTCTTT 


CCTTAGCCAT 


TGATTCTCAA 


7020 


ACTTTCTACT 


ACACTTACAA 


TCGATTCTGT 


TTCAAAGGGA 


AGCTGCTGGG 


CAATTTCTTC 


7080 


TAATTTTTCA 


TTAGCTTGAT 


CCAGGGTTTG 


GT T AC AAAAG 


GCAATGGACT 


CTTCCAAGCC 


7140 


CAACAGGGCA 


GGATAGGTTG 


ATTTTTCTGC 


CTGCAGATCC 


TTTTGAGGTG 


TCTTGCCGAT 


7200 


TTCCTCAAAA 


CT AG CTGTC A 


CATCCAGTAC 


ATCATCTCTG 


ACTTGAAAAG 


CAAGTCCAAT 


7260 


CAATTCACCC 


ACAGTTTTCA 


GCTTCACCTG 


CATTTCAGGT 


GACAATTCAG 


CTATAATAGC 


7320 


TGCCGCTTGG 


AAGGGATAGG 


CTAGTAACTT 


CCCAGTCTTA 


TTGGCATGAA 


TAGTCTGAAG 


7380 


TTCTTCCAAA 


GACAAGTGCT 


GGTGTTCGCC 


CTCCATATCC 


AAAACTTGCC 


CTGCTACCAT 


7440 


ACCCAGACTA 


CCTGAAGCAA 


GGGATAAGTT 


GGCAATCAAG 


TCCACCTTAA 


TCTGACTTGG 


7500 


CAAATCTGCC 


TGCGCAATCA 


AGGCATATGA 


GTCTAAGAAT 


AAGGCATCTC 


CAGCCAAAAT 


7560 
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GGCCATAGCT TCACCGAATT TCTTGTGATT GGTTAACCGC CCTCTTCGAT AATCGTCATC 7 62 0 

ATCCATAGCA GG AAGGT CAT CGTGAATCAA GCTCCCTGTA TGAATCATCT CTAAGGCAGT 7 6 80 

AGCTACCTGG GCGTGAGCAG GTTTGATGGT AACCTGCAAG GCTTCCAGAA CTTCTAACAA 77 40 

GAGAAAAGGC CGAATACGCT TGCCACCAGC ATGAATAGAA TAGAGAACAG ACTCCCGTAA 7 800 

AC T AG AGG C A AACTGCTGGT CTCCATAAAA ATCTTCCAAA GCCGACTCGA CAAGAGCTAA 7 860 

TTTTTCTTGC TTTTTCATTC AAAATCACTT TCTGTTCCGT CTTCTTGCAT GACCTTGACC 7 92 0 

AAGGTCTTTT CAGCCTTGTC CAGCGTAGCT TGGAGCTCTT TTGACAAGAC CATGCCCTTT 7 980 

T G AAAGGC AG TAATCGCATC TTCCAGAGCA ATTTCACCAT TTTCCAAACT TTGGACAATG 804 0 

GTTTCCAGTT CTGCTAGATT TTCCTCAAAT TTCTTTTGTT TTGACATCTT TAACCTCTAA 8100 

TT C T ACT T G A CCATCTCGCA TCAAAAGCGT TACTTGGTCT TTTTTCTTCA AACTCTCAAC 8160 

CGAATCTACA ACGGACTCTT CTTTTTTGAC AATAGCATAA CCACGCGCCA CGATTCGGCT 822 0 

AGTATCCAAC ATGAGCAAAG CTTCCGAAAG TCGCTTGGCC TCAGCAACCT TGGCGTCATA 82 80 

AACTAACGCC ATTTGGCTAC CTAAGAGCTT GTCCAACTGT CCTAAACGGT CTTGATAGCG 8340 

TTGGATTTTG GTAACAGGTG AT AATT GT AC TAATTGATGA GTTCTTGCTT GAACTAATTG 8400 

TTTGTTATCA GAAATCCGAG TTCGCAAACT TTGTTTCAAA CGCAGTTGCA GTTGGTCCAA 84 6 0 

GCGTTGCAAA TAACCGTCAT ACAAGCGCTC AGGTTGTCTA AAGATAACAG ACTGACTGCA 8 520 

TTTTTTCAAA GCCTCTTGTT TCTTAGATAG AACATTTCGG ACTGCCGTTA CCATCCGTTT 85 80 

TTCCTGATTT TGCAAATGAG CTAATACATC CAACTTGGTC ACAGGTGTTG CCAGTTCAGC 8 640 

CGCCGCTGTT GGCGTTGCAG CGCGTCGATC TGCCACAAAA TCTGCCAAGG TCACATCCGT 87 00 

CTCATGCCCC ACACTAGAGA TAACTGGCAA ACG AG ATT C A AAAATAGCTC GTACCACAAT 87 6 0 

TTCTTCGTTA AAGGCCCAGA GATCCTCAAT AGAACCACCT CCACGACCAA TAATGAGCAA 8 82 0 

ATCCAAATCG TCCCGTTGAT TAGCACGCGC AATATTTCTA GCAATTTCCT CCGCAGCCCC 8 880 

TTCACCTTGA ACCTTGGTCG GATAAAGAAG GATGTCAACA CCTGGGAATC GCCTGCTGAC 8940 

GGTCGTGATA ATATCTCGAA TAACGGCTCC ACTACGGCTG GT T AC T AC AC CAATTCTCTT 9000 

AGAAAATTGG GGCAGAGCTT GCTTGAAGCG TTCTTGAAAC AGGCCTTCTT CTGTCAATTT 9060 

TTTCTTAAGT TGTTCAAACT GAATCGCAAG CGCCCCAACC CCATCAGGCT CAGCTTTTTC 912 0 

AATGATG AT G GAGTAGCTAC CACTTGGTTC ATAGACCTGT ACACGCCCAA TCACATTGAT 9180 

CTTCATTCCT TCTTCCAGGT CAAACCCTAA TTTCTGATAA ATCCCAGACC AGATGGTCGC 9240 

TTGAATAACT GCATGGTCAT CCTTTAGGGA GAAATATTGG TGAGTAGGTC GTTTACGAAA 93 00 
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GTTGGAAACT TGACCAGTTA AATAGACCCG TTCCAAGTAT GGGTCTTTAT CGAATTTCAT 93 60 

TTTCAGATAC TTGGTCAAAG TTGTTACCGA TAAATACTTT TCCATCTCCA CCTACTATTC 942 0 

ATTTACTTGC TCTTTCATGG GTATTATTAT ACCAAAAATA TGCCTAAAAA TCTCCATTTA 9480 

TGT AC C ATT A TGAGGGAAAA ATAGAAAAAG GAGGCAAGGC CTCCACATGT GATTATTTGC 954 0 

TGTTTCGAGC TTCTTCCAAA ATCTTTGCAA TCTTGGTCGT CAACAGGTCG ATAGCCACGG 9600 

TATTGCTAAC CCCTTCAGGA ATGACGATAT CAGCATAACG CTTAGTTGAC TCGATAAACT 96 60 

GGTGGTACAT TGGTTTGACC AC AC CT AAGT ACTGGTTAAT AACGCTATCA AGGCTACGGC 972 0 

CACGCTCCTC CATATCACGC TTGATACGAC GAATAATGCG CACATCGTCA TCCGTATCCA 97 80 

CAAAAATCTT GATATCCATC AAATCGCGCA GACGCTTGTC CTCCAAGACC AAAATACCCT 984 0 

CAACGATAAA GACATCTTGA GGTTCCTGAC GATAGGTCTT GCTACTCCGT GTATGCTCTG 9900 

TATAGTCGTA GGTCGGGATG TCCACCGGAC GCCCTGCCAA CAATTCCTTA ATCTGCTCGA 99 60 

TCATCAAGTC TGTATCAAAG GCAAAAGGAT GGTCATAGTT GGTTTTGACG G 10011 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 5 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

CGTGTGGTCT TAAAAATAGA AGACAAAGAA CAAACTGTTG GAGGCTTTGT CCTTGCAGGC 6 0 

TCAGCCCAAG AAAAAACCAA AACAGCTCAA GTTGTGGCTA CTGGACAAGG TGTTCGTACC 12 0 

TTGAACGGTG ACTTGGTTGC TCCAAGTGTT AAAACTGGAG ATCGTGTCTT AGTTGAAGCC 180 

CACGCAGGTC TTGATGTCAA AGATGGCGAT GAAAAGTACA TCATCGTAGG CGAcTAACAT 240 

TTTGGCAATC ATTGAGGAAT AGAAGGAGAA AGTAAGTATG TCAAAAGAAA TTAAATTTTC 300 

ATCAGATGCC CGTTCAGCCA TGGTTCGTGG TGTCGATATC CTTGCAGACA CTGTTAAAGT 3 60 

AACCTTGGGA CCAAAAGGTC GCAATGTCGT TCTTGAAAAG TCATTCGGTT CACCCTTGAT 42 0 

TACCAATGAC GGTGTGACCA TTGCCAAAGA AATCGAATTG G AAGAC CAT T T TG AAAAT AT 480 

GGGTGCTAAG TTAGTATCAG AAGTAGCTTC TAAAACCAAT GATATCGCAG GTGACGGAAC 540 

TACGACTGCA ACAGTCTTGA CCCAAGCTAT CGTCCGTGAA GGAATCAAAA ACGTCACAGC 600 

AGGTGCAAAT CCAATCGGTA TTCGTCGTGG GATTGAAACA GCAGTTGCCG CAGCAGTTGA 66 0 

AGCTTTGAAA AACAACGCCA TCCCTGTTGC CAATAAAGAA GCTATCGCTC AAGTTGCAGC 72 0 
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CGTATCTTCT CGTTCTGAAA AAGTTGGTGA GTACATCTCT GAAGCAATGG AAAAAGTTGG 7 80 

CAAAGACGGT GTCATCACCA TCGAAGAGTC ACGTGGTATG GAAACAGAGC TTGAAGTCGT 840 

AGAAGGAATG CAGTTTGACC GTGGTTACCT TTCACAGTAC ATGGTGACAG ATAGCGAAAA 9 00 

AATGGTGGCT G AC C T TG AAA ATCCGTACAT TTTGATTACA GACAAGAAAA TTTCCAATAT 9 60 

CCAAGAAATC TTGCCACTTT TGG AAAG CAT TCTCCAAAGC AATCGTCCAC TCTTGATTAT 102 0 

TGCGGATGAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT TTGAACAAGA TTCGTGGAAC 108 0 

CTTCAACGTA GTAGCAGTCA AGGCACCTGG TTTTGGTGAC CGTCGCAAAG CCATGCTTGA 1140 

AG AT AT C G C C ATCTTAACAG GCGGAACAGT T AT C AC AG AA GACCTTGGTC TTGAGTTGAA 12 00 

AGATGCGACA ATTGAAGCTC TTGGTCAAGC AGCGAGAGTG ACCGTGGACA AAGATAGCAC 1260 

GGTTATTGTA GAAGGTGCAG GAAATCCTGA AGCGATTTCT CACCGTGTTG CGGTTATCAA 132 0 

GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTGAA AAATTGCAAG AACGCTTGGC 13 80 

CAAATTGTCA GGTGGTGTAG CGGTTATTAA GGTTGGAGCC GCAACTGAAA CTGAGTTGAA 144 0 

AGAAATGAAA CTCCGCATTG AAGATGCCCT CAACGCTACT CGTGCAGCTG TTGAAGAAGG 1500 

TATTGTTGCA GGTGGTGGAA CAGCTCTTGC CAATGTGATT CCAGCTGTTG CTACCTTGGA 1560 

ATTGACAGGA GATGAAGCAA CAGGACGTAA TATTGTTCTC CGTGCTTTGG AAGAACCCGT 162 0 

TCGTCAAATT GCTCACAATG CAGGATTTGA AGGATCTATC GTTATCGATC GTTTGAAAAA 1680 

TGCTGAGCTT GGTATAGGAT TTAACGCAGC AACTGGCGAG TGGGTTAACA TGATTGATCA 174 0 

AGGTATCATT GATCCAGTTA AAGTGAGTCG TTCAGCCCTA CAAAATGCAG CATCTGTAGC 18 00 

CAGCTTGATT T TG AC AAC AG AAGCAGTCGT AGCCAATAAA CCAGAACCAG TAGCCCCAGC 18 60 

TCCAGCAATG GATCCAAGCA TGATGGGCGG GATGATGTAA GCTTTCTATA GAAAACAACT 1920 

TATAAAAAAC ACAAAAGGAG GGAATGACTA ACCCTTCTTT TTATAGGCTC TTTGTCAACT 1980 

GTAGTGGGTT GAAGTCAGCT AAGCTCGAGA AAGGACAAAT TTCGTCCTTT CTTTTTTGAT 2 040 

GTTCAAAGCG AT AAAAAT CC GTTTTTTGAA GTTTTCAAAG TTTCGAAAAC CAAAGGCATT 2100 

GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT TTGGCGTTAG AATAGTGTAG 2160 

TTGAAGGGCG TTGATAATCT TTTCTTTATC TTTGAGGAAG GTTTTAAAGA CAGTCTGAAA 2220 

AATAGGATGA ACTTGCTTAA GATTGTCCTC AATAAGTCCG AAAAATTTCT CCGGTTCCTT 22 80 

ATTCTGAAAG TGAAACAGCA AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA 2 340 

AT AG CT C AAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGACG 2400 

AT AAAATCG C T TAT C ACT C A GTTTACGGCT ATCCTGTTGT ATGAGCTTCC AGTAGCGCTT 2460 
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GATAGCCTTG TAT T CAT GG G ATTTTCGATC CAATTGGTTC ATAATTTGAA CACGCACACG 2 52 0 

ACTCATAGCA CGGCTAAGAT GTTGTACAAT GTGAAAGCGA TCCAACACGA TTTTAGCATT 2 580 

CGGGAGTGAA ACAGTCTGGG AGACTGTTTC AGCCTGAGCC TAGAAATTTG AAAGCGAAGC 2 640 

TGTTTAGCCA AGTCATAGTA AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 27 00 

CGAACGGCTC TATCGTAGCG AAGAAAGTGA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2760 

AGAACAGTGA TAATATTAAG AT TAT C AAAA TCTTGCGCAA TGAAACTCAT CTTTCCCTTA 2820 

GTGAAGGCAT ACTCATCCCA AGACATAATC TTTGGAAGCC G AG AAAAAT C ATGCTCAAAG 2 880 

TGAAAGTCAT TGAGCTTGCG AATGACAGTT GAAGTTGAAA TGGCCAGCTG ATGGGCAATA 2 940 

TCAGTCATAG AAATTTTTTC AATTAACTTT TGAGCAATCT TTTGGTTGAT GATACGAGGG 3000 

ATTTGGTGAT TTTTCTTTAC CAGGGGAGTC TCAGCAACCA TCATTTTTGA ACAGTGATAG 3 060 

CACTTGAAAC GACGCTTTCT AAGGAGAATT CTAGAAGGCA TACCAGTCGT TTCAAGATAA 312 0 

GGAATTTTAG AAGGTTTTTG AAAGTCATAT TTCTTCAATT GGTTTCCGCA CTCAGGGCAA 3180 

GATGGGGCGT CGTAGTCCAG TTTGGCGATG ATTTCCTTGT GTGTATCCTT ATTGATGATG 3240 

TCTAAAATCT GGATATTAGG GTCTTTAATA TCGAGCAGTT TTGTGATAAA ATGTAATTGT 3300 

TCCATATGAA TCTTTCTAAT GAGTTGTTTT GTCGCTTTTC ATTATAGGTC ATATGGGACT 3 3 60 

TTTTTTCTAC AACAAAATAG GCTCCATAAT ATCTATAAGG GATTTACCCA CTACAAATAT 342 0 

TATAGAGCCG AAAATTCACA TCTAATATAT GC AG ACT ACT TTGAAATGAA ATTAAAAAAA 3 4 80 

TTATTAAAGG ATGACACAAA AGTTTTTGAA AAAT C T AC AT TCAAATTTGT AGAAGGATAT 3540 

AAAAT AT AC C TGACAGAATC TAAAGAATCT GGAATTAAAC AAATGGACAA TGTCATAAAA 3 600 

TATTTTGAGT TTATTGAATC TAAAAGTATT GCTTTATATT TTCAAAAACG ATTAAATGAG 3 660 

CTGATAGATT AAATAGCATT TTCTCTGTTG AGATATTGTT TTTAAAATAT TGTACTAAAT 372 0 

GATTGATGCT ATGTGGAAAT ACAAAAAAAT GTTTTTGATA CGAAGTTGAC CTGTATTTTT 37 80 

TATACTAATC ATTTTCGTAT TTTTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 3 840 

GGAATAAAGA CATTAAAAAA TAACAGTATA TCTATTTGTT TT AT AT AT TT TACGAATTCT 3 90 0 

GCATAAATCT CTTTCTAGTA ATGTGTTGTA ACTCTGCTAT AATAGATTTA TTCCTTTTTG 3 9 60 

TGTTTACACA ATTTATTTTA TAGT AC C AAA AAAGGTCAGG ATTTTGTTCC TGACCTTTGA 4020 

CAACTTTACC GATTCTTTAG TTCTACATAG CGCTTGTACC AAATGTTTAC ATAGGCTTCT 4080 

GAGAAAGGAC CACGTCCATT GTTAATCCAA TCAACAAGAA TTTTGACATG TTCTTTTAAA 414 0 

ATATAGTCCA AGTCATCAGA ATAATTCATT TTGCGTTTGT GACGCTCGTA CTCTTCAACG 4200 

TCCAAGAGAC GTTTTTCCCC ATCTGTAAAA ATTTTAACAT CCAAATCGTA ATCAATATAC 42 60 
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TTCAGTGCTT CTTCATCCAG ATAGTAGGGG CTAGCCATAT TGCAATAGTA AGAAGTTCCA 4 320 

TTATCACGAA TCATGGCAAT GATATTAAAC CAATATTTCT TGTGAAAGTA AACAATAGCC 43 8 0 

GGTTCTCGAG TGACCCAACG AC G AC CATC A CTTTCGGTAA CAAGTGTATG ATCGTTGACA 4440 

CCAATAATGG CGTTTTCTGT TGTTTTTAGT ACCATGGTGT CCCGCCAAGT TCGGTGGAGA 4 500 

CTCCCATCAT GCTTATAACT TTGAATTGTA ATAAAGTCGC CTTCTTTTGG AAGCTTCATA 4560 

ACTAACCAAC TTTCTACAAT TTATAAGTTT ATCATTTACT ATTGT AC CAT AAAATT AC C C 4 62 0 

AAAATCTGTG AATTTCACTT GGAAATATTA AAGATATTCT CTAAGAGCGC TTGCTATATC 4 680 

CGAAAAATCG TAGCCCTTTC GTGCTAAAAC TTGAGTTAAA CGCTGCTTCA GTTCGTATCC 474 0 

TTCATACTTT CGGGCATACT TAGTATATTG CTTATCAAGT TCCTTGAAGA TGAGTTCCTG 4 800 

AGTCGTTTCT TCATCAACTT GACTATCCAA TTCGTCAAAG GCAATTTTAG CATCAAAATA 4 860 

AGAGAAGCCC TTGTTAGTCA AGTTCTGGAT AATCTTATCT TGCAGGGCAC GAGCTGGAAG 4 92 0 

TTTTCCCTCA TATTTTTTCA ATAGTTTATT GGCTACACGT TGAGCAACTT CCGAAAAATC 4 9 80 

AAAATC AT T C AAGATTTCTT C T AT AGT AG A TTTTGAAATT CCTTTTTGTG CTAATTTCTG 5040 

AGTCAGTACA TAAGGTCCCT TGTCTCCTGA AAGTTGATTG GCATTGATGA TAGCATAAGC 5100 

GTACTGGCTA TCATTAATCC ACTTCTCTTC TTTAAGATTA GCAATGACTT GAGAAACGAT 5160 

GTTTTCATTA ATATCATATT TTTTCAGATA TTCTCTGACC TCTTTTTCAG TACGTGCTTT 5220 

AAAGGATAAG TGGTAGAGGG C C AG AT T C TT ACCATAAGAA AATTGAGCAA AGTCTTGAAT 52 80 

CTCTTTCAAT TCCTCTTCGC TTATCACCTT ATCTCTCGAT AACATAAAAC GAACAATTGT 5340 

GTCTTCGGTG ATATAGCATT TGTCG 53 65 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 63 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 78: 

TTTCCAGAAA GAAGTTGAGT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTG 60 

ACATTAGGTT TTATTTCTAC TTTACTAGCG TCCGCCCTAG CATTTTCTAA ATCTTTAATC 12 0 

TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTGCTT GAGGATTTTC ACTCAGTCCA 180 

TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATGCA TGACTTGTAA AATAATCAGA 240 
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GTCATTGTAG AAAAAATCAG GGTGAAGACA CCGAAGTTGC GGATAAAATA ACTAAAGTCA 3 00 

TCCGCATACC ATGTTTTTTT AAGTTTACTG AACATCTTTT AAAAGATACC CAACACTACG 3 60 

CAAAGTTTGC AAATTCTCTG CAAAAGTGGT TCCCTTTAAT TTCTTACGGA CTTTTGAAAC 42 0 

AT AG ACT T CG ACAACCGAAA TCGTTGTATC ACTATCAAAT CCCCATAGAC GGTCAAAAAT 480 

CTGCGTCTTA GGCAAAATCA CATTTTGATT TTGAAGGAAA TAAACTAGTA AATCGAACTC 54 0 

TTTCCCCAGC AATTCGACAG GAGTATCTTC AACTTTAACG GTATTGGTTG ATAAATTAAC 600 

CACGATATTC CCATAAGTCA AGGTGTTTTC ATTAAACTTC CCTGAACGTT TGAGAAGGGC 6 60 

CTGAATCCGC ATTTTAAGTT CTTCTAGGTA GAAAGGTTTG GTCAGATAAT CATCCGCTCC 72 0 

CAGTTCAAAT CCATGTCCCT TGTCATCCAA ACTTTCCTTG GCAGTCATAA TCAGAACTGG 780 

TGTCGTAATT CCCTTTTCAC GCAATTCTTT TAAGACTTGG AAACCATTTT TTTCTGGCAA 84 0 

CATCAAATCC AGCAAAATCA AGTCATAGAC ACCACTCTCA GCTTCGTAGA GACCTTCTTC 900 

TCCATCAAAT ACCTGCATAA CATCCGCAAA ATCGTCTAAA AAGTCAAATA CTGAATTTGA 9 60 

CAGACCTAGG TCATCCTCAA CCAATAAGAT TTTTATCATG AGAAACTCCT CCTTATTAAA 102 0 

AC T ATT AT AC CAAATTTGCC TTAAAAAAAA CTCAACTCTC TGCATTTTAC ATGAGATAGC 1080 

TGAGTTTTCT TTTTATTTTA GGCTTATTTA TGCATTTCCG TATTGAAGAA CAACTGCTTC 1140 

GACTGCAGCT TTTTCACGGC TAATCAAGTC AACACGCGCT GCAATTTCCT TGATTCCCAT 12 00 

ACCGATGTTA CGGCTAAGAG CAAGGTCAGA AAGTTGCGGT TCAAAGAACT CCTTGTATTC 12 60 

CGCCAAGCGT TGCTGAGTCT TAAATACATG AGCAGGAAGG ATAACAAAGC TATCAAAGCT 132 0 

CATATCTCCT CCAAGGGCTG CCTTAATCCA AGCCCAGTTT TCACGCGCCC AAGACCAAGC 13 80 

TGTTTTCTGA GTTGCTTGAT GAGCTAGGAA TTGGTAATAC CAAGCAGACA AGTCCTGTGG 1440 

TTTGACCACA AATTTGTCCT TCCAAGAAGT AATCAGGTTT TGGATATTAT CCGCATCTGT 1500 

ACTGTATGCA AGAGCTGCTG CCAACTGGCG TTTAAAGACA GCATCTGTTG CGTGAGTATA 15 60 

AGTATCAAGA TAAAGTGCTA ACAAGTCTTT AGTCTCATGA TGTTTCATCT CATTAATCAG 162 0 

AACTTGTGAG CGAATAGCTG CTGGGAGTCC TGCAAGATTC TCCTTGTGTG TTGCGAAGAT 1680 

TTGGCTAGCG ACTTGACTAG CTTCTGCATC ATTTGAGCGA AT CAT C AT CG AAAC AG C C AG 174 0 

CTGACGAACC AATTCATCCT CATCTGATTC TCCGTCTTTA GCTTCAAAAC CAAGACGGTC 1800 

ATAGTTATGA CGAGCCAATT TAGCAACCAG TCCTTTGAAG GCTGTTTCAG CATCCGTTCC 1860 

T T CAT C AAT A AAGCGCTCAA GGGCTGAAAT CACTTGAGAA ACAGCTGAAA CCACCAGATA 192 0 

AGACTCTTCC TTAGCAAGTT TATCAAGAAC TGGAAGCAAG TCTGCATAAG AAATGTGCCC 1980 

TGCCTCAGCC AACAAACGAC GTTCTTGAAC AATTTGCAGT TTGCTTGTGT TATCAAGTGT 2040 
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CTCTAGCTCA GCAAGAACAG CTGCTAACAA GTCTCCTTGA TAGTCGGTAA TATAGTGGGC 2100 

AGTATTTTCA GTGTTGAGAC GAAGAGCTCC TTCATTTTCA G C AAGAAG AG CTGCGTAGCC 2160 

AGGGATTTCG AT ACT T T C AG TTTCGAGTGT ATCAGGCAAG CCTTTCCAGT TGCTATTGAG 2220 

GGGCACCACC CAGAGACGGT TCTTGTCTTC GTTCTCACCG ATGAAGAATT GTTTTTGTGA 2280 

AATCTTCAAG AC AT CAT T T T CAACTTTAAC AGTAAGAACT GGGTAACCAG GCTGTTCCAA 2 34 0 

CCAAGAATCC ATGAAGGCTG C G AC AT C AC G TCCTGACGCT TGACCAAGGG CATCCCAAAG 2400 

GT C ACT AC C A ATGGTGTTGC TGTATTGGTG TTTTTCAAAG TAGGCGTGCA AACCTTTAGC 2 4 60 

AAAATCAGCA TCTCCTAGCC AACGGCGAAG CATGTGCATG AGACGGCTTC CTTTGGCATA 252 0 

GACGATAGCG CCGTCAAAGA GTGTATTGAT TTCATCTGGA TGTTTAACTT CGACGTGGAC 2 580 

AGACTGAACG CC AT CAGT AG CGTCACGTTC AAGAGCAAGA GGTACTCCAC CTGTTTGGAA 2 64 0 

ATCTTCAAAG ATATTCCAGC TTGGTTCGAT GGTATCCACA CAGACGTATT C CATC AT ATT 2 700 

AGCGAAACTT TCATTGAGCC AAAGGTCATC CCACCATTTC ATAGTCACGA GGTTCCCAAA 27 60 

CCATTGGTGA GCCAATTCAT GGGCCACAAC AAGGGCAACT TGTTGACGGC TAGCAAATGT 2 82 0 

AGAGTTCTCA TCGACAACCA AGTAAACTTC ACGGTAGGTC ACAAGACCCC AGTTTTCCAT 2 88 0 

AGCACCAGCT G AG AAGT C AG GAAGGGCGAT GTGGAGAGAT TGAGGAATTG GGTACTTAAC 2 940 

TCCATAGTAA TCTTCGTAAA ACTCGATAGA GCGAACAGCG ATATCCAGTG AGAAATCAAG 3 000 

ATTTGAAAGT GGATGTGCTT TGGTTGAGTA GACACCTACC AGGGTACCAT TTTTAGTTTT 3 060 

AGCGGTCACC CCTTGCAAAT CACCAGCAAC AAAGGCCAAC AAGT AAGAAG ACATGCGAGG 312 0 

TGTTGTCTCA AACTTCCAGA TACCTGTTTC CTTACGGTTT TCAACATCGA TTTCTGGCAT 3180 

GTTTGACAAG GCCAATTCAC CTTCTGCTTG GTCAAAGCGA AGAGAGAGGT CAAAAGTTGC 32 40 

TTTGGCTTCA GGCTCATCCA CACATGGGAA AGcTTCGCGC GCAAAATGGC TCTCGAACTG 3 3 00 

AGTAGACAAG ACCTCCTTCT TGACTCCATC AACTGTATAA T AAGAAG GG T AAATCCCTGT 3 3 60 

CATGTTGTCT GTAATTTTAC CAGAAAAGGC AAGAACCAAT TCAACTTGAC CAGCCTCAGC 342 0 

CAATTCGATA TGAAGGGCTT CATTGTCATG GTCAACTGTA AATGGACGAG CTTGACCTGC 3480 

AACTTCTACA GAGGTGATTT CCAAATCTTT TTGGTGGAGG GAGATGCGGT CACTCTGTGC 3 54 0 

TTGACCAGTG ATGGTCACTT TCCCAGAAAA AGTCTTGGTC TCACGACTCA AATCTAAAAA 3 600 

TAAATCATAA TGTTCAGGAA CAAATTGCTT AATGGG 3 63 6 

(2) INFORMATION FOR SEQ ID NO : 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5066 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 79: 

ATAGCGTGTA ATAATCGATT TTAGAGGTAC CATAAGCCAC CTCCTACAAA TAGAAACCGA 6 0 

TATAAATCAA TGCCTTCCAC CCTTAGACTT CCCTAGTTCC TGTCTCAAGC GAAACATTTC 12 0 

TTTGAAACAG GAATAAGTTA ACCAATTCAT ACCAATAGCT AGCAGAATAA AAAGAAACCA 18 0 

AATGCCCCAT AACTTGATAT C TGTCAC AT T TCTCAAGACG GTATTGAAAA ACAGAACTGA 24 0 

AACAACTGTC CAAGCAAGGC TAAAAAGAGA AT AG AAG GGG ATGTAAAACC AGTAAAAATA 300 

ATAAAAAATT GGAAAAAACT TACTATTTCT GTTGGCCTTT TCAATCCAGT TATCAAAATA 3 60 

AAAGTACGGT GCTAAAAGTA AGAATTTAAA CAAATGTTCC ATCACCGACA TCCCCCCTTC 42 0 

TTTTGATAGC GTTTTCTATT ATTTTATTAT ATCAAAAAAA TCCGGAACTG T CAT T C C AG A 4 80 

TTCTACTTTT TTATTTGCGT TTTCTTGCGA TGAGATGAAT CGGTGTTCCC TCAAAAACAA 54 0 

AGGCCTTGCG GATTTGATTT TCCAAGAAAC GCAGGTAAGA AAAGTGCATG AGTTCTTCTT 600 

CATTGACAAA GATGACAAAG GTTGGTGGTT TGGTTGCCAC TTGGGTCGCA TAG AAA AT CT 660 

TGAGACGTTT TCCTTTGTCT GTCGGTGTTG GGTTGATGGC AATGGCATCC AT GAT G AC AT 72 0 

CGTTCAAGAC AGCTGATGGA ATACGTGTAT TTTGACTTTC GCTGATTTGC TTAATCATCT 7 80 

CAGGAAGTTT GTGGAGACGT TGCTTGGTTA AAGCTGATAC AAAGATAATC GGTGCGTAAG 84 0 

GCAGGTATTG GAACTGCTCA CGGATATCTT CTTCCCAGTT TTTCATAGTG TGGTTATCTT 900 

TTTCAAGCGT ATCCCACTTG TTGACCACGA TAATCATCCC TTTACCAGCT TCATGGGCAA 960 

ATCCTGCGAT ACGCTTGTCG TACTCACGAA TGCCTTCTTC CGCATTGATG ACCATCAAGA 1020 

CCACATCTGA ACGGTCAATA GCACGCATGG CACGCATAAC AGAGTATTTC TCAGTATTTT 1080 

CATAAACCTT ACCAGACTTA CGCATACCAG CCGTATCAAT CATGGTAAAC TCTTGACCAT 114 0 

CTGTATCTGT AAAGTGGGTA TCAATGGCAT CACGAGTTGT TCCAGCAACA GGACTAGCAA 12 0 0 

TAACACGGTC TTCTCCCAAG ATAGCATTGA TCAAGCTTGA TTTTCCAACG TTAGGACGAC 12 6 0 

CAATCAAGCT AAACTTAATG AC AT CTGG AT TTTCTTCCTC AT ATT CAT T T GGAAGATTTT 132 0 

CTACGATCGC ATCTAGCACA TCCCCTGTAC CGATTCCATG GACAGATGAG ATAGGCAATG 13 8 0 

GTTCACCCAA ACCGAGAGCA TAGAAATCAT ATATATCATT TCTCATCTCA GGGTTGTCCA 1440 

CCTTGTTGAC TGCGAGGATA ACTGGTTTGT GGGTCTTATA AAGCTTACGA GCTACGTATT 1500 

CGTCTGCATC AGTAATTCCT TCCTTACCAG ACACGACAAA AACGATAACA TCTGCTTCTT 1560 
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CCATGGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA AGGAGCATCG AC AT CAT C AA 162 0 

TTCCTCCTGT ATCAATCATG CTAAAAGAAC G ATT GAG C C A CTCACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGGAGAT TCGCTCACCA GCGATCCGAT 1740 

TAAATAGGGT TGATTTCCCA ACATTGGGAC GTCCTACAAT GGCAATAGTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTCAAGAT TTTTTCTAGT TGAGCTTGGT 1860 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TG ACT C C AG C TTGTGGTCGC ACGCGCCCCA 19 2 0 

GCATAGTCAG CCTGAACACG GTCATAAGCT TGGATTGCCT CAGTTGACTG TTCTTGGTAT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC AGTCTCGGTT TCATATCATG ATGTTGATTT 2 040 

GGCACACCCA GTGCCATCCC AAAGACAGAA TAGGTGTAGT CAGGTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCGTT TTGTCCAGCA AGAGCTGCAT CGACCGAACT AATCAAGAGA 2220 

CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGGCTCCCTT TTCTGCTCGG 2 2 80 

TTCAAATCTC CGACAAAGAG AAGGAAAACA GCAGACTGGC GAATGGCTTC TTGAGGTACC 234 0 

AAT T CAT AC A AGGCATCTTT CTTCTCTTGA CTTCGTACCA CAATCACAGA GTAGGATTGG 24 00 

AAATTCTTCC AAGATGATGC CATCTGGGCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 24 60 

GGAATTTCTT GCTCTTTAAA CCTGCGCACT G AAGT AT GAG CCTTCATCAA TTTAATGGTT 2 52 0 

TCTGTCATCG ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCGTT 2580 

CCATGACCCG TCTGGCTTCC CAGGTTTCGT CATTTCCATG TTTCACTTTC GCAAAATGCT 264 0 

TCTCCAAATC TTCAAAGTTG AAGTTGGATG TGAAAAAGGT CGGTAAATTT TCCTGCATCC 27 00 

GATATTGGAG AATGACCTGC AGGATTTCGT CACGCACCCA AACGGTTGAT TGCTCGGCGC 27 60 

C AAT AT CAT C TAAAATCAGG ACCTCAGACA GCTTAATCTC ATCCACCAAG GTCTTAACAT 2 82 0 

TGC CATC ACT GATAGCATTT TTGACATCAA TGACAAAGCT AGGATAGTGG AGGAGAGTTG 2880 

ATGAAACACC ACGTTTTTCT GATAAATCAT GAGCTAAGGC CGCCACCATG AAACTTTTAC 2 940 

CCACACCAAA GTCTCCATAT AAGTAAAGAC CTTTTCGAAT AGCTGGATAT TGCTCCACGA 3 000 

AGGCTAGTAG CTTTTCAAAA ACTGGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 3060 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG 312 0 

CTTCTTTTTC AGCCGCGATT AGCTCAGGAG TTTCTTCATA TGAAACATCT G C AT AACC AT 3180 

GATTCTTAAC CAAAATCGGC TTGTAGCCTT TGGCAATATA ATCCGTATCC CCACGGAGAA 3240 

ACTTGTCACG CTCGGTGATG TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTCCTTTG 3300 
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GAGTTAAGGA 


TTCTTGCTGG 


ATAAAGGCCG 


CAACATCAGG 


GTCCTTCATG 


ATTTTCTGGA 


3360 


CCAAATCTTG 


ATAATAAAAA 


CGGCTGGGTT 


GACGTTTGAG 


TACGTCTCCG 


ACACTTTCCA 


3420 


TCTAATCTCC 


TCCTTTTTCT 


AATCGAGCTA 


ATAGTTCTTG 


CTTCTTACGT 


TCTAGTTCCA 


3480 


GACGAGTTTC 


CTCGCTGGTT 


TCATTCTTAT 


ATTCAGGATT 


ACTCCATTTA 


GGAACATTGG 


3540 


TTTTTTCTGG 


GGCAGTCTGA 


TTCTGTTTTT 


GTGTTTTTGC 


TTTCTGCCCT 


CGATCACGAA 


3600 


TTCGTAAAAC 


GGCCTCTTCT 


GCCGAATGAA 


TCTTTTGATA 


GGCATAGTCA 


TTGGCTACCT 


3660 


TCATGGCATA 


TTTCTCATTG 


ATATTTGCCG 


AATCCACCTT 


ATT AAAGGT C 


AATAAGAGAA 


3720 


TAATATTGAT 


GACTTCGTCC 


AGTAAGCCCA 


AGCCAGCCAT 


CTGTTGCAAG 


AGTTCTCTTT 


3780 


CTGTTTGGGT 


AATGGTTCCC 


TTGCGTGTTT 


GCTTGATTTC 


TGCTAAGAAC 


TGCAGGGCAG 


3840 


TTTTACTTTT 


AGCTTCTTTG 


ATAATGGTCG 


CTTCCTTAAG 


ACTAAAGTCA 


GAGGAAACTG 


3900 


GTTTTTGAGC 


AATTTTTTCA 


CGCATGCGTT 


TGGTTGAAAT 


AACCTGGGAA 


ACAGCTGTTG 


3960 


ACTTGGCCAA 


TTGATAGGTT 


TCAAACCAAG 


TCCATTTCTT 


CTCCTCGGCA 


ATAGCAAAGA 


4020 


GGTTTAAGAC 


ATCGGACTGC 


TCATCCGCAA 


AACGAAGTCC 


ATCTCGAGCC 


ATCAGCTGGC 


4080 


GAAAATGTTC 


CAAGTCAAAA 


TCATTGGCCA 


CTTTCTTCTT 


GAGACCAAGG 


TCTTCTTGAC 


4140 


TGCCTAGTTC 


TGCCAATTCT 


GGAAAGACTT 


GATTGAGTGA 


G AC AGGT AT T 


TCTTCACCAT 


4200 


CAGCACTTTC 


AAC TTTC AAA 


TCCTCCACAG 


CTACATCGCC 


AATCTTTTTC 


TCTAAGAGTC 


4260 


TGCGATAAAC 


AGGATGCCCC 


AAGAAGTCTT 


G AC TAG AT AG 


AGGAGCATGG 


AGGGCTAGCT 


4320 


GATAAACATC 


ACCCTTTTGA 


TAGAGGGTCA 


AG AG AT T AAA 


AGCAGATAAG 


ATTTTCAATG 


4380 


ATTTTATCAG 


TCTATCCATC 


CCAAAGTTGA 


GATGGTTGAG 


AATGCTTGAA 


AAAAGATATT 


4440 


CCTTTCTACC 


ATT AT C C C AA 


AAACTGATTG 


TATAAAGATA 


AAGGCTCAGT 


GCCTCCTGAC 


4500 


CGATAATCGG 


GAGGTAGCAC 


TGTACCAGAG 


ATGAGGTATC 


T TGCG AC AC C 


CGATTATTCT 


4560 


TTAGATAAGA 


AAAACGGTCA 


ATTGGCTTCA 


TTTATCTTTC 


CTTTTTCTTT 


TTAGAGGACT 


4620 


GGGTGATTTG 


TTGGAGCAAG 


CTCTCTAACT 


C ACTG AC AT C 


CTTAAAACTA 


CGATAGACAC 


4680 


TAG C AAAACG 


TACATAGGTA 


ATCTCGTCCA 


ATTCAGCCAA 


CTCCTCCATG 


ACGAGTGAAC 


4740 


CAATGTCCTC 


ACTTTGAATT 


TCATTTTCAT 


TTCGACCACG 


GAGTTTCTGT 


TCGATACGAT 


4800 


TGACTACCAT 


GTTGATTTCA 


TCACTTGACA 


CAGGACGTTT 


CTGGGCTGAG 


CGGATAATCC 


4860 


CATTAAAGAT 


TTTATCTCTG 


GAGAATTGTT 


CCCGTGTGCC 


ATCTTTTTTA 


ACAACCACTA 


4920 


AGGTTCTTTC 


TTCTACTCGT 


TCGTAGGTTG 


TAAAACGGTG 


TTGGCATTCG 


TCGCACTCAC 


4980 


GTCTTCTACG 


AATGGTGTTC 


CCTTCTTCTG 


CTTGGCGACT 


AT C GAT AAC A 


CTTGACTTGG 


5040 


TAGCCCCACA 


TTTTGGACAG 


GGTACC 








5066 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 607 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

CACTTGAAGT ATTTGAAACA GCTATGGAAA ACATCATGCC TGTACTTGAA GTACGTGCAC 60 

GTCGTGTTGG TGGTTCTAAC TACCAAGTCC CAGTTGAAGT TCGTCCAGAA CGTCGTACAA 12 0 

C ACTT GG ACT TCGTTGGTTG GTAACAATCG CTCGTCTTCG TGGTGAACAC ACAATGCAAG 18 0 

ACCGTCTTGC AAAAGAAATC TTGGATGCTG CTAACAACAC TGGTGCAGCA GTTAAGAAAC 2 40 

GTGAAGATAC TCACCGTATG GCTGAAGCTA ACCCTGCATT CGCACACTTC CGTTGGTAAG 3 00 

ATAGGATGCG AAAGCGTTAA GAAAGTCCCA GAGAAAATAG GGAATCGAAG CAGGTTGCGG 3 60 

TTGCAACCAA TGAGATTCAT CTTTTTCTCC AGACTTTTAG CTTGAGCTCA ACTAAATCAT 420 

GATGCTAGGA ACGGTAAGGA TGCAAGGTAA AAATAGGAAA CTGACGCAGT ATTCGACGAA 480 

TACAAGGAGT TTTATCTTTT TCACGCAGCA TCCCGTTCCA GCTCACATCG GCTAACTAAC 540 

TTTAGCCCGG GTTCAAATTA GCTAAATCGA TTAGTATTAG CTATAACTCA GCTTACCATC 600 

TCGTAAGTTG AAACCAACAA TAGCATGAAA ACATTGAGAA CGGGTAGGTC CTGCCTATCC 660 

GTTTTTATTA AAATCGTGTT ATAATAGAAT AGAAATCAAA AATAAATAGG AGAAACAAAC 72 0 

CTCATGGCAC GCGAATTTTC ACTTGAAAAA ACTCGTAATA T CGGT AT C AT GGCTCACGTC 7 80 

GATGCCGGTA AAACAACAAC TACTGAGCGT ATTCTTTACT ACACTGGTAA AAT C C AC AAA 84 0 

ATCGGTGAAA CTCACGAAGG TGCGTCACAA ATGGACTGGA TGGAGCAAGA GCAAGAACGT 900 

GGT AT C AC G A TCACATCTGC TGCGACGACA GCTCAATGGA ACAACCACCG CGTAAACATC 9 60 

AT CG AC AC AC CAGGACACGT GGACTTCACA ATCGAAGTAC AACGTTCTCT TCGTGTATTG 102 0 

GATGGTGCGG TTACCGTTCT TGACTCACAA TCAGGTGTTG AGCCTCAAAC TGAAACAGTT 1080 

TGGCGTCAAG CAACTGAGTA CGGAGTTCCA CGTATCGTAT TTGCCAACAA AATGGACAAA 1140 

ATCGGTGCTG ACTTCCTTTA CTCTGTAAGC AC ACT T C ACG ATCGTCTTCA AGCAAATGCA 12 00 

CACCCAATCC AATTGCCAAT CGGTTCTGAA GATGACTTCC GTGGT AT CAT TGACTTGATC 12 60 

AAGATGAAAG CTGAAATCTA TACTAACGAC CTTGGTACGG ATATCCTTGA AGAAGACATC 132 0 

CCAGCTGAAT ACCTTGACCA AGCTCAAGAA T AC C GTG AAA AAT TG AT T G A AGCAGTTGCT 1380 
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GAAACTGACG AAGAATTGAT GATGAAATAC CTCGAAGGTG AAGAAATCAC TAACGAAGAA 144 0 

TTGAAAGCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCAGT ATTGTGTGGT 15 00 

TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 15 60 

AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG ATACAGACGC TGAAGAAATT 162 0 

CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TCAAGATCAT GACTGACCCA 16 8 0 

TTCGTAGGTC GTTTGACATT CTTCCGTGTT TACTCAGGTG TTCTTCAATC AGGTTCATAC 17 40 

GTATTGAATA CTTCTAAAGG TAAACGTGAA CGTATCGGAC GTATCCTTCA AATGCACGCT 1800 

AACAGCCGTC AAGAAATCGA CACTGTTTAC TCAGGTGATA TCGCTGCTGC CGTTGGTTTG 18 60 

AAAGATACTA CAACTGGTGA CTCATTGACA GATGAAAAAG CTAAAATCAT CCTTGAGTCA 192 0 

ATCAACGTTC CAGAACCAGT TATCCAATTG ATGGTTGAGC CAAAATCTAA AGCTGACCAA 19 80 

GACAAGATGG GTATCGCCCT TCAAAAATTG GCTGAAGAAG ATCCAACATT CCGCGTTGAA 2 04 0 

AC AAACGT TG AAACTGGTGA AACAGTTATC TCAGGTATGG GTGAACTTCA CCTTGACGTC 2100 

CTTGTTGATC GTATGCGTCG TGAGTTCAAA GTTGAAGCGA ACGTAGGTGC TCCTCAAGTA 2160 

TCTTACCGTG AAACATTCCG CGCTTCTACT CAAGCACGTG GATTCTTCAA ACGTCAGTCT 22 2 0 

GGTGGTAAAG GTCAATTCGG TGATGTATGG ATTGAATTTA CTCCAAACGA AGAAGGTAAA 2 2 80 

GGATTCGAAT TCGAAAACGC AATCGTCGGT GGTGTGGTTC CTCGTGAATT TATCCCAGCG 2 3 40 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 2 4 00 

GACGTTAAAG CTAAGCTTTA TGATGGTTCA TATCACGATG TCGACTCATC TGAAACTGCC 24 60 

TTCAAGATTG CGGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA AC C AG CT AT C 2 52 0 

CTTGAACCAA TGATGCTTGT AACAATCACT GTTCCAGAAG AAAACCTTGG TGATGTTATG 2 580 

GGTCACGTAA CTGCTCGTCG TGGACGTGTA GATGGTATGG AAGCACACGG TAACAGCCAA 2 64 0 

ATCGTTCGTG CTTACGTTCC ACTTGCTGAA ATGTTCGGTT ACGCAACAGT TCTTCGTTCT 2 7 00 

GCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 2 7 60 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2820 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTGT CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2880 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAATTT TGATGAATCA 2 940 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG GGAGTTTCAT TTCGTATTTT CTAATATTTT 3060 

ACCGATTGAA ATCTTTTTAG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACAGG 312 0 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAAGAA TTTTCAAATC AGCTCTTGCA 3180 
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GTCCTACTAT AGGGAGGCTT TCTCTGAAGG TCCATGTGCT AGTCAAGGTT TTAAGTCCCT 3 2 40 

TTATCAAAAT TTGCAAGTTC GTGCTGAGGG TAATCAGCTA TTTATTGAAG GATCTGAAGC 3 3 00 

GATTGATAAG GAACATTTTA AGAAGAATCA TCTTCCTAAT TTAGCCAAAC AACTTGAAAA 3 3 60 

GTTTGGTTTT CCAACTTTTA ACTGTCAAGT CGAGAAGAAT GATGTCCTGA CCCAAGAGCA 342 0 

GGAAGAGGCC TTTCATGCTG AAAATGAGCA GATTGTTCAA GCTGCCAATG AGGAAGCGCT 3480 

CCGTGCTATG GAACAACTGG AGCAGATGGC ACCTCCTCCA GCGGAAGAGA AACCAGCCTT 3 540 

TGATTTTCAA GCGAAAAAAG CTGCAGCTAA ACCCAAGCTG GATAAGGCGG AGATTACTCC 3 600 

TATGATCGAA GTGACGACAG AGGAAAATCG TCTGGTATTT GAAGGGGTTG TTTTTGATGT 3 660 

GGAGCAAAAA GTGACTAGAA CAGGTCGTGT TTTAATCAAC TTTAAAATGA CGGACTATAC 3 720 

TTCAAGTTTT TCTATGCAAA AGTGGGTTAA AAACGAGGAA GAGGCCCAGA AGTTTGACCT 3780 

CAT C AAG AAG AATTCTTGGC TCCGAGTTCG AGGGAATGTG GAGATGAATA AC T T C AC AC G 3840 

CGATTTGACT AT G AACGT AC AGGATCTGCA GGAAGTTGTT CACTATGAGC GGAAGGATTT 3 90 0 

GATGCCAGAA GGTGAGCGTC GGGTTGAGTT TCATGCTCAT ACTAACATGT CGACTATGGA 3960 

TGCTTTGCCA GAGGTCGAAG AGATTGTTGC AACAGCTGCT AAGTGGGGAC ACAAGGCGGT 4020 

TGCTATCACG G AC C ATGGG A ATGTCCAGTC CTTTCCACAT GGCTATAAGG CGGCTAAGAA 4080 

AGCGGGAATC CAGCTGATCT ATGGGATGGA AGCCAATATC GTGGAGGACC GTGTCCCTAT 4140 

CGTCTATAAC GAAGTGGAGA TGGACTTGTC AGAAGCAACC TACGTGGTCT TTGACGTGGA 42 00 

AACGACGGGA CTTTCAGCTA TCTATAATGA CTTGATTCAG GTTGCGGCTT CTAAGATGTA 42 60 

CAAGGGGAAT GTTATTGCTG AATTTGATGA ATTTATCAAT CCTGGGCATC CCTTGTCAGC 4320 

CTTTACTACA GAGTTAACTG GAATTACAGA TGATCATGTC AAAAATGCCA AACCACTAGA 4380 

ACAAGTTTTG CAAGAATTCC AAGAATTTTG CAAGGATACG GTCCTAGTTG CCCACAATGC 444 0 

TACCTTTGAC GTTGGCTTTA TGAATGCTAA TTATGAGCGG CATGATCTTC CAAAGATTAG 4 500 

TCAGCCAGTT ATTGATACGC TGGAGTTTGC TAGAAACCTC TATCCTGAGT AT AAACG C C A 4 560 

TGGTTTGGGG CCTTTGACCA AGCGTTTTGG TGTGGCCTTG GAACATCACC ACATGGCCAA 4 620 

CTACGATGCG GAAGCGACTG GTCGTCTGCT TTTCATCTTT ATCAAAGAGG TAGCAGAAAA 4 680 

AC ATGGT GTG ACCGATTTAG CTAGACTCAA CATTGATCTA AT C AGTC C AG ATTCTTACAA 4740 

AAAAGCTCGG ATCAAGCATG CGACCATCTA TGTCAAGAAT CAGGTAGGTC TAAAAAATAT 4 800 

CTTTAAGCTG GTTTCCTTGT CTAATACCAA GTATTTTGAA GGAGTGCCAC GGATTCCGAG 4 8 60 

AACGGTTCTA GATGCCCATC GAGAGGGCTT GATTTTAGGT TCAGCCTGTT CAGAGGGTGA 492 0 
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AGTTTTTGAC GTGGTCGTTT CTCAAGGTGT GGATGCGGCG GTTGAGGTGG CCAAGTATTA 4 980 

TGATTTTATC GAGGTCATGC CACCGGCTAT CTATGCACCC TTGATTGCCA AAGAGCAGGT 5 040 

CAAGGATATG GAGGAACTCC AGACCATTAT CAAGAGTTTG AT AG AGGT TG GAGACCGCCT 5100 

TGGCAAGCCT GTTCTGGCTA CGGGAAATGT TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA GTTTGGGACA GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

TGGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 52 80 

GTTGGATGAA TTTGCCTTTT TGGGAGAGGA ACTGGCTCGT AAACTGGTTA TTGAAAACAC 53 4 0 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCGTT AAGGGT G ACT TGTATACGCC 54 00 

TTTCATCGAC AAGGCTGAAG AAACAGTTGC TGAGTTGACC TATAAGAAAG CTTTTGAGAT 54 60 

TTATGGAAAT CCGCTGCCAG AT ATTGT T G A TTTGCGGATT GAAAAAGAAT T AAC AT C CAT 552 0 

ACTGGGGAAT GGATTTGCTG TGATTTATCT GGCATCGCAG ATGCTGGTGC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCGGA TCTAGTTTCG TTGCGACCAT 564 0 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 57 00 

CAGTGAGTTT ATCACAGATG GTTCGTACGG TTCAGGATTT GATATGCCCC ATAAGGACTG 57 60 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCGTTTG AGACCTTCCT 582 0 

TGGTTTTGAT GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTCGGGAG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 5940 

GGTTGGTACG GTAGCTGCCA AGACTGCCTA TGGATTTGTC AAAGGTTACG AG CG AG ATT A 6000 

TGGCAAGTTT TATCGTGATG CAGAAGTAGA ACGCCTCGCT CAAGGAGCGG CGGGTGTCAA 6060 

GCGGACAACA GGCCAACACC CGGGGGGAAT CGTTGTTATT CCGAACTACA TGGATGTCTA 612 0 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATGTACTGG GACATGATGA 62 40 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTCCTAT 63 00 

GGATGACGAA GGCGTGATGG CACTCTTTTC TGGGACTGAT GTGCTAGGGG TAACACCTGA 63 60 

ACAAATTGGA ACGCCTACGG GTATGTTGGG GATTCCAGAG TTTGGAACAA ATTTCGTACG 642 0 

TGGAATGGTA GACGAAACCC AT C CG AC AAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 6480 

GTCCCACGGT ACTGATGTTT GGTTGGGGAA TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

GG AC CT AT CG ACTGTTATCG GTTGTCGGGA CGACATCATG GTTTACCTCA TGCATGCGGG 6600 

TCTGGAACCT AAGATGGCCT T T AC CAT TAT GGAACGGGTA CGTAAGGGTT TGTGGCTAAA 6660 

GATTTCAGAA GAGGAGAGAA ATGGCTATAT CGAAGCAATG AAGGCTAATA AGGTGCCAGA 6720 
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GTGGTATATC GAATCCTGTG GGAAAATTAA GTACATGTTC CCTAAGGCCC ATGCGGCAGC 67 80 

CTACGTTATG ATGGCCTTGC GTGTAGCTTA CTTCAAGGTT CACCATCCTA TTTATTACTA 6840 

CTGTGCTTAC TTCTCCATTC GTGCTAAGGC TTTTGATATC AAGACCATGG GTGCGGGCTT 69 00 

GGAGGTCATC AAGCGCAGAA TGGAAGAAAT CTCTGAAAAA CGGAAGAACA ATGAAGCCTC 696 0 

TAATGTGGAA AT CG AT CTCT ATACAACTCT TG AG AT T GT C AATGAGATGT GGGAACGAGG 7 02 0 

TTTCAAGTTT GGTAAATTAG ATCTCTACTG TAGTCAGGCG ACAGAGTTCC TCATCGACGG 7 080 

GGATACCCTT ATCCCACCAT TTGTAGCAAT GGATGGTCTG GGAGAGAACG TTGCCAAGCA 7140 

ACTGGTGCGG GCGCGTGAAG AGGGAGAATT CCTCTCTAAA ACAGAACTAC GCAAGCGTGG 72 00 

TGGACTCTCA TCAACCTTGG TTGAAAAGAT GGATGAGATG GGTATTCTTG GAAATATGCC 72 60 

AGAGGATAAC CAGTTGAGTT TGTTTGATGA GTTGTTTTAA AAAATTGCTT AATAATCTAT 7 32 0 

TAAAAGAGGC TAACGTATAT CCAATAGATT T AC ATT AG C T TTCTTTTTTG TTAAAATAGT 73 8 0 

CTATGGAAAG AGGGTGAGAG TATGTCAAAG ATGAGTATAA GCATCCGTCT GGATAGTGAG 74 4 0 

GTTAAGGAGC AGGCCCAACA GGTGTTTAGT AATCTGGGAA TGGATATGAC AACAGCTATT 7500 

AATATTTTCC TTCGTCAGGC AATTCAATAT CAGGGATTAC CTTTTGATGT TAGACTAGAC 7 5 60 

GAAAATCGGA AGTTGCTCCA AGCGTTAACG GAT T TAG AC C AAAATCGTAA TAT GAG C C AG 7 62 0 

TCTTTTGAAT CAGTCTCAGA TTTGATGGAG GACTTACGTG CTTAAGATTC GT T AT C AT AA 7 6 80 

ACAGTTTAAA AAAGATTTTA AGTTGGCTAT GAAGCGTGGT TTGAAGGCAG AATTATTAGA 774 0 

AGAAGTTTTG AATTTTCTGG TTCAAGAAAA AG AAC AT C C T GCCAGAAATC GT G AT C ATT C 7 8 00 

AT TG ACGGC A TCCAAGCATT TTCAAGGAGT TCGTGAATGC CATACCCAGC CAGATTGGCT 7860 

TTTGGTTTAT AAAGTAGACA AGTCGGAATT GATTTTAAAT TTGCTGAGGA CAGGCAGTCA 7 92 0 

CAGTGATTTA TTTTAATCTA TTTTAAGGGG GTTCTCATGA AACTAAGAAT ATTTGCGGAA 7 980 

GATAAGCCGG CTAAGAAGGT ATTTGAATAT CAATTAGAAC TTGCTGATCG TACAATTCTT 804 0 

CTATCGACAG CACTCTTGTC AGGTGCTATT GCTTTAGCAG GAATCTTTTC TGCTTTGAAA 8100 

GAAAAATAAA AATAGAAAAG AGAAAACAGA ATGGTTTTAC CAAATTTTAA AGAAAATCTA 8160 

GAAAAATATG CGAAATTGTT GGTTGCGAAC GGAATTAACG TGCAACCTGG T C AC AC T T TG 822 0 

GCTCTCTCTA TTGATGTGGA GCAACGTGAA TTGGCACATC TAATCGTGAA AGAAGCTTAT 82 80 

GCCTTGGGTG CGCATGAGGT CATCGTTCAG TGGACAGATG ATGTGATTAA CCGTGAGAAA 8340 

TTCCTCCATG CCCCGATGGA GCGTTTGGAC AATGTGCCAG AATACAAGAT TGCTGAGATG 8400 

AACTATCTCT TGGAGAATAA GGCTAGCCGT CTTGGAGTTC GTTCATCTGA TCCAGGTGCC 84 60 
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TTGAACGGAG TGGACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATGGG ACTTGCCATG 852 0 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GGACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 864 0 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGC AG AT C C TGTTAAGGCT 87 00 

TGGGAGGAAC ATGCAGCCAT TCTCAAGAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 87 60 

TCAGCCCTTC ACT AC AC AG C GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 882 0 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTGCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTGCAG ATGGTTATGT CACTTCTACA 8940 

AAACCGCTTA GCTACAACGG AAATATCATT GAAGGCATTA AGGTGACCTT TAAGGATGGA 9 000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT GATCAGGTTA TGAAAGACCT TGTCTTTGAA 9060 

AATGCGGGTG CGCGTGCCTT GGGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 912 0 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCGTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TAGCGTTGTT GATGGAGCGG AG ATG AG CG A AGAGGAGCTT 924 0 

GAAGCTGCAG GGCTTAACCG TTCAGATGTT C ACGT AG AC T TTATGATTGG TTCTAACCAA 93 00 

ATGGATATCG ATGGTATTCG TGAGGATGGA ACGCGGGTAC CTCTTTTCCG TAATGGGAAT 9 3 60 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 942 0 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AATGTTTCTC 94 8 0 

GGTTGGATCG GAGCCTTTCT AGGTCACTTG CTCTTTGGAA CTTGGGGGCC AGTTTTATCA 9 54 0 

GGAACAGCTA TTATCCCAGC GATTTTAGGA GCCATGATTG TTTTAGCTAT TTTTTGGAGA 9600 

CGAGGAA 9 607 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATAGTA AG AGCG AG CT CTATGATAAG 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CATAGGAACG TAATCGGTAA CCGATGACAA 12 0 

AAATAGTATA C C AC AAT AC A TTTAGATCAT CAAGGTCACT TAATTCTTGA AAT AT C AG AT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAAC TTGAT 240 
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ACTATGCGTT TTATTGTGGG AAGGTTTACT CCATTTTCTC CTGAAATTGA GTTTTTGTCC 3 00 

AGCCTCTGTT TTTAGGGTTG CTAAGAAAAT AATGTCATGT GGTGAATATT TGTAAATCAG 3 60 

TCAGCAGACA GAACGATACT CTTCGAAAAT CTCTTCACAT CATGTCAGCT TCGTCTTTCC 42 0 

GTATATATGT GACTGACTTC ATCAGTTCTA TCTACAACCT CAAAACAGTG TTTCGAGCTG 4 80 

ACTTGATCAA TTTTCAAATC TGTACTTTGA GCAAGCTGAG ACTAGCTTCC TATTTGATTT 54 0 

TCATTGAATA TCAGAAACCC ATTCTCCATC AAATAATTCG ACTGCGTCTA ATAATTTTTG 600 

ATCTGGCACG GTGTCTGAAA TAAAGGTTGT GTATTTGGAG AGGGGATTAA TTTTAAAAAA 6 60 

TCCAGTCTTG T AAAATT T AG AACTATCAAT CAGTAAGATG GTTTCATGGG CTTTGTCAAT 72 0 

AATATTCTTT TTTGAAATAG CTTGGCTGAG AGAAGCTTCA TAAACATATT GGTCATCAAT 7 80 

ACCTCTTGCT GAACAAAATG CTAAATCGAT ATTAAAATGA TCTAATAAAG AATTTTCCTT 84 0 

ATCATAGTTG ACCACGGAAC AGGATTGATG TTTGACCTCG CC AG AT GTG A TAAAGATTTT 9 00 

GGAGCTATCT TTAACAGTTT CAGATAGGGT TTGTGCAGTA TGTAAACCAT TTGTAAAAAT 9 60 

AATCAAATTA TCAAGTTCAG AAAGATAGGG ACAGAGTTCG TAGACAGTAG TACTAGAATC 102 0 

TAGATAGATA CACATACCAG AC C G AAT AAA GTCTTTAGCG AGACTAGCGA TTAGTCTTTT 108 0 

TTGCCTAGTA CTTTCTCCTT CACGTATTTG ATGAGAAAGT TCAATTGTGT TCATAGAGGA 114 0 

CAGGGTCACG TATCCGTGCT TTCTTTTGAT AAGACCTTGA TTTTCTAAGA AAATTAAATC 12 00 

ACGACGTAAG GTACTTGTGC TGGAGAAAGT GATTTCTGCC AGCTCTTTTA CGGCAATTCT 12 60 

TTTTTTCTTT TTGATAATTT CAATCAATTC AAGTACACGT TCATCTTTTA TCATAAGCTC 13 2 0 

CTCCTAATTT ATCATTTCAA C TAT ATT AT A GCACAAATTG GAGGAATTTG AATTATTTTT 13 80 

ATGAATATTG GGTTAACATT TGAACATTAT TCAAGTAAGC GTTCACATAT TGAAAAAATA 144 0 

AAACGTGGGG ATTATAATAA AGTTAATCmA GGACGAAGAG AGAAGAAAAA TGGAAGCGGT 1500 

TTTAGCAATA GATTTAGGTG CGACTTCTGG AAGAGCAATC GTTGGTTACC TTTCTGAAAA 15 60 

TAAACTAGTA ATGGAAGAAA TAAATCGCTT TTCTAATCTA CCTATTAGAG TAAAAGGGCA 162 0 

TTTATCTTGG GATATTGACT TTCTACTAGC TAAAATTCTT GAAAGTATCC GCTTGGCTAA 1680 

TACTAGTTAC AAGATTTTAT CTATCGGTAT TGACACATGG GGAGTTGATT TTGGACTGAT 17 4 0 

TGATAATGAA GGTAAGCTGT TATTACAACC TGTT C ATT AT CGT G ATG AAA GAACAAAGGG 1800 

AGTGTTAAAG GAAATATCTG AAATGACTGA ATTAGAAAAA CTGTATTCAG AGACAGGAAA 1860 

TCAGATTATG GAGATAAATA CCTTGTTTCA ACTCTTTAAG GCACGTCAAG AATCTCCTGA 192 0 

CTCTTTCTAT AAG AC C AAT A AGATTCTTTT AATGCCAGAT TTGTTTAATT ATCTCTTGAC 1980 



WO 98/18931 



PCT/US97/19588 



644 



AGGTAAGTTT 


GCTACAGAAA 


AAAGCATTGC 


TTCAACAACT 


CAATTATTTG 


ATCCTAGGAG 


2040 


TCAAAATTGG 


AATCAGAATA 


TCTTAAAACT 


ATTTGAATTG 


GATTCATCTT 


TACTTCCTGA 


2100 


AATTGTTTCA 


GAGGGAAATG 


TTCTTGGAAG 


GATAAAAGAG 


GAGTATGGTT 


TAGGCGATAT 


2160 


TCCTGTTGTG 


AATGTTTGTA 


GTCATGATAC 


AGCAAGCGCG 


ATTGTCTCAG 


TACCTAAGAC 


2220 


AGAAGGTAGT 


TTATTTATTT 


CATCAGGTAC 


TTGGTCTTTG 


GTTGGAGTGG 


AAC TT AC TT C 


2280 


ACCGATTCTT 


ACTACCGAAT 


CCTTCAGTTA 


TGGATTTACA 


AATGAAGTCG 


GTAAAGATGG 


2340 


AGTGATTACA 


TTTCTGAAGA 


ATTGTACAGG 


GTTGTGGATC 


ATAGAGGAAC 


TAAGACGTTC 


2400 


ATTTGAACGA 


AGAGGGAAAG 


CCTATTCTTT 


TGATGATATT 


AGGACAATGG 


TGGAGAAAGA 


2460 


AAAAGAAAAT 


CTTCCTCTGA 


TTGATACTGA 


ATCAACTGAA 


TTTGCAACAG 


AATCTGATAT 


2520 


GCACAAGACT 


TTGACAGAAT 


ATCTAGCTTA 


T CATC AT G AA 


AC T AG AG AGT 


GGACAGATGG 


2580 


AC AAC T ATT T 


AAGATTGTTT 


ATGAAAGCCT 


AGCTGAAACG 


TATAGGAAAG 


C G AT AGAGTT 


2640 


ACTAGAAGAA 


CT AAC TC AT A 


AGGTTTATAA 


GAGGATATAT 


GTGATTGGAG 


GAGGTGCTAG 


2700 


AGCCAGTTAC 


TTTAACCAAA 


TGATTGCTGA 


TAGAACTGGT 


AAAGAGGTTC 


TTACAGGTTT 


2760 


GACTGAGGCT 


ACAGCTGTGG 


GGAATATTGT 


TGTGCAGCTC 


ATAGCTATGG 


GACAATTAAA 


2820 


AGGGATGGAA 


GAGGCTCACC 


ATGTTATTGA 


GGAGTTTCTA 


CAATTAGAGA 


GTTATTACTC 


2880 


CCAAAAGAAT 


TAAAAAGATT 


GAGAGTTTGT 


AAATTTGCCT 


CCCTCCCCCT 


TCTTAGCTTT 


2940 


TGTGCAGGAA 


GGGGGGATAA 


TTGGTGAATT 


G AAAAAT AT T 


TAGTGTTTTG 


ATATGAGGAG 


3000 


GACAAGGATG 


TCAGATGTAA 


AACAAGAATT 


AATTAAATAT 


GGTAAGAAGC 


TAGTAGAAAC 


3060 


AGATTTGACG 


AAAGGAACAG 


GTGGGAATCT 


CAGCGTTTTC 


GATCGTGAAA 


AACAATTGAT 


3120 


GGCAATTACC 


CCGTCGGGTA 


TTGATTTCTT 


TGAAATCAAA 


GAATCCGATA 


TTGTAGTGAT 


3180 


GGATATTAAT 


GGAAATGTTG 


TAGAGGGAGA 


ACGCTTGCCA 


TCTAGCGAAT 


GGTATATGCA 


3240 


TTTGATTCAA 


TATCAAACTC 


GTGATGATAT 


CGATGCAATT 


ATCCATGCTC 


ATACAACTTA 


3300 


TGCAACAGTA 


TTAGCTTGTC 


TCAGAGAACC 


ACTTCCAGCG 


AGT C ATT AT A 


TGATTGCAGT 


3360 


GGCAGGGAAA 


GATGTTCGGG 


TAGCTGAGTA 


TGCAACATAT 


GGCACGAAAG 


AATTGGCTGT 


3420 


GAATGCAGCT 


AAAGCAATGG 


AAGGTCGTAG 


AGCAGTTTTA 


CTAGCGAATC 


ATGGAATTTT 


3480 


AGCAGGTGCA 


CAAAATTTAT 


TGAATGCATT 


TAATATTGTT 


GAAGAAGTTG 


AATATTGTGC 


3540 


AAAAATTTAT 


TGTTTAGCTA 


AGAATTTTGG 


AGAGCCAGTA 


GTTCTTCCTG 


ATGAGGAGAT 


3600 


GGAATTGATG 


GCAGAAAAAT 


TTAAAACATA 


CGGTCAGAGA 


AAATAGGGAG 


GATATTAATG 


3660 


TTAAAACATA 


TACCGAAAAA 


TATTTCTCCA 


GATTTATTGA 


AGACTTTAAT 


GGAAATGGGA 


3720 


CATGGAGATG 


AAATAGTATT 


AGCTGACGCG 


AATTATCCTT 


CTGCCTCATG 


TGCAAATAAG 


3780 
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CTAATTCGTT GTGATGGTGT AAATATTCCA GAATTATTAG ATTCCATTCT GTATTTAATG 3 840 

C C ATT AG AT A GTTACGTCGA TAGTTCAATT CAGTTTATGA ACGTTGTTTC GGGTGATGAT 3 900 

ATTCCTAAGA TATGGGGTAC CTATAGACAG ATGATTGAAG GTCATGGTAC AGATCTTAAA 3 960 

ACGATTACTT ATCTTAGAAG AGAAGACTTT TATGAACGTA GTAAGAAAGC TTATGCTATT 4 02 0 

GTTGCTACAG GAGAAACTTC ACTTTATGCT AATATTATCC TTAAGAAAGG AGTAGTTGTT 4 080 

GAAAGAGAAA ATGTTCAATA GAGGAATTTT AGTTGCCAGT CATGGTAATT TTGCTAGCGG 414 0 

AGCTCTCATG ACCGCAGAAA TGTTTGTTGG TGAGACAACA AATGATAGAG TTAGGACATT 4200 

AGGTTTGATG CCTGGAGAGA ATATTGTAGA GTTTG AG CAT TATTTTAAAA ATCAAGTGGA 42 60 

TGAACTGTTA GACTCAAATC AAGAGGTTAT CGTTTTGACT GACTTGATTG GAGGAAGTCC 43 2 0 

TAATAATGTG GCTTTGTCAC GGTTTTTAAA TTTGGATTCA GTTGATATTG TAACAGGGTT 43 80 

TAATATCCCT CTCCTAGTGG AATTAATATC AAGTTATGAT TCAAAAATCA ATTTAGAAGA 4440 

AATTGTTCAC AATGCTCAAA ATAGTTTGTT TAATGTTAAA CAACAACTTA ACGTAGAGGA 4500 

GGAAGAAGAT TTATGTCTAT AGAGTTTGTT CGTATTGATG ACCGTCTGGT ACATGGTCAA 4 5 60 

GTTGTCACTA CGTGGCTAAA AAAGTATGAT ATTGAGCAAG TTATCATTGT TAATGATCGC 4 62 0 

ATCTCAGAAG ATAAAACACG ACAATCTATT TTAAAGATTT CTGCACCGGT AGGTTTAAAA 468 0 

ATTGTTTTCT TTAGTGTAAA ACGGTTTGTG GAAGTTTTAA ACTCTGTGCC AATAAAAAAG 4 74 0 

AGAACAATGC TGATATATAC AAATCCAAAA GATGTGTATG ATT CT AT T G A AGGAAATTTA 4800 

AAATTGGAGT ACCTCAATGT AGGACAGATG AGTAAAACGG AGGAAAATGA AAAGGTAACG 4860 

GGAGGTGTAG CTCTAGGTGA AGAAGACAAA TATTATTTTA AGAAAATAGT T G AT AAGGG A 4 92 0 

ACGAGAGTTG AAATTCAAAT GGTTCCTAAT GATAAAGTTA CAATGTTGGA AAAATTTTTA 4 9 80 

TAAAAATAAT TTAAGGAGGT ACAGTATATG CTATTCACAC AAGC AT T AC T GGTGACATTA 5 04 0 

GTTGGGATTA TTGCCACTAT TGACTATAAT GGACCGTTAT TTATGATTCA CCGTCCGTTA 5100 

GTTACAAGTG CAATGGTTGG CTTAGTATTA GGAGATTTCA CCCAAGGTGT TCTTATTGGT 5160 

TCAGCTCTTG AATTAACTTG GCTCGGTGTA ACAGGTATTG G AGGT T AT AC TCCACCAGAT 5220 

ACT ATT TC AG GTGCGATTAT TGGTACTGCA TTTGGTATTT TATCTGGTCA AGGAGAAACT 5280 

GCTGGTATCG CTATAGCAGT TCCAATTGCA GTTGCTACCC AACAGTTGGA TGTTCTTGCA 5340 

AAAACTTTAG ATGTTTATTT TGTGAAAAAA GCTGATAATG ATGCTAAAAA CGGAGATTAT 5400 

TCAAAGATCG GTTTTTATCA TTATTCAAGT TTGGTTTTAA TCACGTTATT TAAAATTGTA 5460 

CCAATTTTCC TAGCTATTAT GCTTGGAGGG GAATATGTGG CAGACTTGTT TGCTAAGGTT 552 0 
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CCACCAATCG TTATGCAGGG ACTTAACTCT GCAGGTGCTT TACTACCTTC AATTGGTTTT 5580 

GGTATGCTTT TAAATATGAT GCTCAAGAAA AATATGTGGG TATTCTTGTT GATTGGATTC 5640 

ATTTGTTCTG TGTATGGAGG AATGTCAACC ATTGGGATCT CACTAGTTGG TATTGCGGTA 5700 

GCATACTTCT AC G AT ATG AT TGGAAGCAAA C C AC AAG AAA CAACTTCAAG TAGTGATGTT 57 60 

GAGGAGGATC TTGATCTATG ATGAATAATA AAGTAACTAA AGTTGAACTT AAAAAAGTTT 5 82 0 

TCAAACGAAG TTTTATGTAT GGTTCTTCAT GGAACTATGA GAGAATGCAG AACCTAGGTT 5880 

T T C TAT AT AC AATTCTTCCA GTATTGAAAA AAC T AT AC C C AGACAAAGAT TCAGCTTCTC 5 940 

CTGCAATGAA ACGTCACCTT GAGTTTTTCA ATACTCATCA AACAGCGGCA C CAT TT ATT C 6000 

TTGGAGTTAC TTCCGCTATG GAAGAACAAG AAGGAAATGA AGGTGCAGCT TCAATTACTG 60 60 

GTATTAAAGT TGGCTTGATG GGGCCACTGG CTGGTCTAGG AGATAGTTTG TTCTGGCTGA 612 0 

CACTAGTTCC TATCTGTTTT AGTATTGGTG CGTCTTATTC TAAAGACGGC GGTGCTTTAG 6180 

GTATCTTTAT CGCCTTAATA TTGTTTAATA TTATTAATAT TCCTGTTAAA TATTTCGGTT 62 4 0 

TGAAATATGG GTATACTAAG GGTTCTAGTC TTATCCAAGA AAATAATACA AAAGGAACAT 6300 

TGAATCGCGT TACGAGTATG GCGACAGCAT TAGGGCTAGT ACTAGTGGGT GGTTTGATTC 63 60 

CAT C AATGGT TGGTATTAAT TTTGGATTAG AATTTAAGCA GGGGGAACTT GTTATTTCTG 642 0 

TTCAAGAAAT GAT T AC AAAA TTAATTCCAG GATTTATCCC TATGGCTTTG ACTTTATTAA 64 80 

TGTGTAAATT AAT TAG AAAA GGAAAGAATC CGGTTGTACT AATCTTTAGT GTTATGGCTA 6 54 0 

TTGGAGTTAT TCTAGTTGTT TTAGGAATTT TGAAGTAGTA GAAAGTGTGG AGGTGGTATT 6600 

TGGGATATCA CCTCCATTTT GGAAGAGAGG TAAAGAGTGA AATTATGGTA TAAGAAAGCT 6660 

GCCGCAAATT GGAATGAAGC CTTGCCGATT GGGAACGGTC ATTTAGGTGG TATGATTTAT 672 0 

GGTTCAGCTA CAAAAGAATG TATTCAACTA AACGATGAGA CTATTTGGTA TAGAGGAAAG 6780 

T C AG AT AGAA ATAATCCAGA CTCACTATTG CATCTTAAAA AAATTCGGGA ATATCTTTTA 6840 

GATGGAGAAA TTCAGAAAGC CGAAGAATTG ATAAAGTTAA CAGTGTTTGC TACCCCAAGA 69 00 

GATCAAAGCC ACTATGAATT ACTTGGGGAA CTTTACATTG AGCATATAGA TATTCAGTCT 6960 

TGTGCTCTTT CATTGTATGA AAGAGAGCTA GATTTAGATA CAGCTATTTC TAATGTTGTG 7 02 0 

TTTGAGCCTA ATAGTTGTAA TTTACAAATA AAAAGAGAAT ATTTTACGAG TTTTAATAAG 7080 

AATATTTTAT GTTGCCGTAT AGTGTCATCA GTTCAAAACA CATTAAATTT AAACATTAAT 714 0 

TTGGGTAGAA ATAAACGGTT TAATGACGAA GTATCTAAAC TGGATTCAAG TACAATTTTA 7200 

ATGTCGGCCT CTGCTGGAGG TAGAAAAGGT GTTCAGTTTA AAGTAGTATG TCATTCTAAG 7260 

GTTACGGATG GTGAAGTAAG TGTATTGGGA GAGACAATAG TTATTCGGAA TGCTACAGAG 73 2 0 
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GTATTTCTTT 


ATCTCAAATC 


AATGACGGAT 


TATTGGGGAA 


ATATAGATAT 


TTCTTCTCTT 


7380 


CAGGGAGAAT 


TTAGTAGTAT 


TGATTACTTT 


ACAGAAAAAG 


ATGAACATGT 


AAAAAAATAT 


7440 


CAGGAGCAAT 


TTAATAGAGT 


TGATTTTAAA 


CTAGACTATA 


GTAAAGGTTG 


TCTT AG CAT T 


7500 


CCAACGAATC 


TACTTCTTGA 


AAACACTAAA 


AAGTATAGTA 


ACT ACT TG AC 


TAACTTGTTA 


7560 


TTTCATTATG 


GAAGATATCT 


GTTAATATCG 


TCTAGTCAAC 


CGAATGGTTT 


AC CTGCC AAT 


7620 


CTTCAAGGAA 


TATGGTGTGA 


TGAATTAAAT 


CCAATTTGGG 


GTTCTAAATA 


T AC GAT T AAT 


7680 


ATTAATACTC 


AAATGAATTA 


TTGGATGGTA 


GGTCCATGTG 


ATTTACCAGA 


AGTAGAATAT 


7740 


CCATTATTTG 


ATATGCTCGA 


AAGAATGAGA 


GAACCGGGAA 


GACTAACCGC 


TAAGAAAATG 


7800 


TATGGAGCTA 


GAGGTTTTAC 


AG C AC AT CAT 


AATACGGATG 


GTTTTGGCGA 


TACGGCTCCC 


7860 


CAATCTCATG 


CCATGGGGGC 


TGCAATTTGG 


GTATTAACTA 


TTCCATGGTT 


ATGTACTCAT 


7920 


ATTTGGGAAC 


ACTATTTATA 


TTTCCAAGAT 


GAGCGTATTC 


TTACGGAACA 


TTTTGAAATG 


7980 


ATAAAAGAAG 


CATTTCTTTT 


CTTTGAAGAT 


TATTTATTTG 


AGGTGGATGG 


CTACTTGATG 


8040 


ACAGGTCCAA 


GTGTCTCACC 


GGAAAATAAA 


TATCGCTTAA 


AAAATGGTAT 


TGAAGGAAAT 


8100 


GCTTGTCTAT 


CAT C T AC AAT 


TGATAATCAA 


ATTCTAAGAT 


ATTTTTGTGA 


TTCATGCATT 


8160 


GGCATTGCAA 


AACAATTAGG 


AG AC AAT TCG 


GATTTTATTA 


GTCGTGTGAA 


GGAGTTAAAA 


8220 


AAGAAACTAC 


CTAAAACAAA 


AATAGGTAGT 


AATGGGCAAA 


TCCAAGAATG 


GTTAGAAGAT 


8280 


TATGAAGAAG 


TAGAGCCTGG 


GCATAGACAC 


ATTTCACCTC 


TATTTGGGCT 


TTATCCTTAT 


8340 


AATGAG AT T G 


ATATTCATAA 


AACTCCGGAA 


TTAGCAGAAG 


CAGCTAAAAT 


C AC TAT C AAT 


8400 


AGGAGATTAT 


CAAACGCTAA 


TTTTTTATCT 


TCACAGGAGA 


GGGAGCAAGC 


GAT T AAT AAT 


8460 


TGGTTAGTAA 


GTGGTTTGCA 


TGCTAGTACA 


CAAACAGGTT 


GGAGTGCTGC 


ATGGCTGATT 


8520 


CATTTTTTTG 


CGAGACTATA 


TCAAGGTGAA 


CCTGCTTATA 


ACCAGATTAA 


TGGTTTGTTA 


8580 


AATAATGCGA 


CTCTTGGCAA 


TTTATTTCTT 


GACCATCCAC 


CATTTCAAAT 


TGATGGTAAT 


8640 


TTAGGTTTGG 


TGAGTGGAAT 


TTGTGAATTA 


TTAGTACAGA 


GCCATCATAA 


TTGGTTATCA 


8700 


CTAATTCCAG 


CTTTACCTTC 


TGCTTGGTCA 


GAAGGAGAAG 


TGAAAGGTTT 


CAGAGTAAGA 


8760 


GGAGGATATA 


AGGTATCGTT 


TGCTTGGAAA 


AATGGGGATA 


TAACATTCCT 


AAAATTGGAA 


8820 


GGAGGAAACA 


AAGATCAAAA 


AGTAAGAGTA 


AGAATATATG 


G C AAAAAT AC 


TGATGTACAA 


8880 


AATATTGAAT 


TGGTATTTAA 


TTCAGAAAAA 


ATTATTGAGT 


TAAATTTTTA 


GGTATAAGTC 


8940 


ATGAATAAAG 


AAAAAATAAA 


AAGAAAATTA 


ATCACAATAT 


TGTTTGTATG 


TATTGGGATG 


9000 


TTATGTTTTG 


GATTGTTAGC 


AGGAGTTAAG 


GCTGATAATC 


GTGTTCAAAT 


GAGAACGACG 


9060 
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ATTAATAATG 


AATCGCCATT 


GTTGCTTTCT 


CCGTTGTATG 


GCAATGATAA 


TGGTAACGGA 


9120 


TTATGGTGGG 


GGAACACATT 


GAAGGGAGCA 


TGGGAAGCTA 


TTCCTGAAGA 


TGTAAAGCCA 


9180 


TATGCAGCGA 


TTGAACTTCA 


TCCTGCAAAA 


GTCTGTAAAC 


CAACAAGTTG 


T ATT C C ACG A 


9240 


GATACGAAAG 


AATTGAGAGA 


ATGGTATGTC 


AAGATGTTGG 


AGGAAGCTCA 


AAGTCTAAAC 


9300 


ATTCCAGTTT 


TCTTGGTTAT 


TATGTCGGCT 


GGAGAGCGTA 


ATACAGTTCC 


TCCAGAGTGG 


9360 


TTAGATGAAC 


AATTCCAAAA 


GTATAGTGTG 


TTAAAAGGTG 


TTTTAAATAT 


T G AGAATT AT 


9420 


TGGATTTACA 


ATAACCAGTT 


AGCTCCGCAT 


AGTGCTAAAT 


ATTTGGAAGT 


TTGTGCCAAA 


9480 


TATGGAGCGC 


ATTTTATCTG 


GCATGATCAT 


GAAAAATGGT 


TCTGGGAAAC 


TAT TATG AAT 


9540 


GATCCGACAT 


TCTTTGAAGC 


GAGTCAAAAA 


TATCATAAAA 


ATTTGGTGTT 


GGCAACTAAA 


9600 


AATACGCCAA 


TAAGAGATGA 


TGCGGGTACA 


GATTCTATCG 


TTAGTGGATT 


TTGGTTGAGT 


9660 


GGCTTATGTG 


ATAACTGGGG 


CTCATCAACA 


GATACATGGA 


AATGGTGGGA 


AAAAC AT TAT 


9720 


ACAAACACAT 


TTGAAACTGG 


AAGAGCTAGG 


GATATGAGAT 


C C T AT GC AT C 


GGAACCAGAA 


9780 


TCAATG AT TG 


CTATGGAAAT 


GATGAATGTA 


TATACTGGGG 


GAGGCACAGT 


TTATAATTTC 


9840 


GAATGTGCCG 


C GT AT AC ATT 


TATGACAAAT 


GATGTACCAA 


CTCCAGCATT 


TACTAAAGGT 


9900 


ATTATTCCTT 


TCTTTAGACA 


TGCTATACAA 


AATCCAGCTC 


CAAGTAAGGA 


AGAAGTTGTA 


9960 


AATAGAACAA 


AAGCTGTATT 


TTGGAATGGA 


GAAGGTAGGA 


TT AG T TC ATT 


AAACGG AT T T 


10020 


TATCAAGGAC 


TTTATTCGAA 


TGATGAAACA 


ATGCCTTTAT 


AT AAT AAT G G 


GAGATATCAT 


10080 


ATTCTTCCTG 


TAATACATGA 


GAAAATTGAT 


AAGGAAAAGA 


TTTCATCTAT 


ATTCCCTAAT 


10140 


GCAAAAATTT 


TGACTAAAAA 


TAGTGAGGAA 


TTGTCTAGTA 


AAGTCAACTA 


TTTAAACTCG 


10200 


CTTTATCCAA 


AACTTTATGA 


AGGAGATGGG 


TATGCTCAGC 


GTGTAGGTAA 


TTCCTGGTAT 


10260 


ATTTATAATA 


GTAATGCTAA 


TATCAATAAA 


AATCAGCAAG 


TAATGTTGCC 


TATGTATACT 


10320 


AATAATACAA 


AGTCGTTATC 


GTTAGATTTG 


ACG CC AC AT A 


CTTACGCTGT 


TGTTAAAGAA 


10380 


AATCCAAATA 


ATTTACATAT 


TTTATTGAAT 


AAT T AC AG G A 


CAGATAAGAC 


AGCTATGTGG 


10440 


GCATTATCAG 


GAAATTTTGA 


TGCATCAAAA 


AGTTGGAAGA 


AAGAAGAATT 


AGAGTTAGCG 


10500 


AACTGGATAA 


GCAAAAATTA 


TTCCATCAAT 


CCTGTAGATA 


ATGACTTTAG 


GACAACAACA 


10560 


CTTACATTAA 


AAGGGCATAC 


TGGTCATAAA 


CCTCAGATAA 


ATATAAGTGG 


CGATAAAAAT 


10620 


CATTATACTT 


ATACAGAAAA 


TTGGGATGAG 


AAT AC CC ATG 


TTTATACCAT 


TACGGTTAAT 


10680 


CATAATGGAA 


TGGTAGAGAT 


GTCTATAAAT 


ACTGAGGGGA 


CAGGTCCAGT 


CTCTTTCCCA 


10740 


ACACCAGATA 


AATTTAATGA 


TGGTAATTTG 


AATATAGCAT 


ATGCAAAACC 


AACAACACAA 


10800 


AGTTCTGTAG 


ATTACAATGG 


AGACCCTAAT 


AGAGCTGTGG 


ATGGTAACAG 


AAATGGTAAT 


10860 
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TTTAACTCTG GTTCGGTAAC ACACACTAGG GCAGATAATC CCTCTTGGTG GGAAGTCGAT 10920 

TTGAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTTATA ATCGCACAGA TGCTGAGACT 10980 

CAACGTCTAT CTAATTTTGA TGTGATTCTA TATGACAATA AT AG AAAC G A AGTTGCTAAG 11040 

AAACATGTTA ATAATTTGTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG TAAGCAATCT GAAGAGGATA TAGATAAAAT AACAGAAGAT 1122 0 

AAAGTAGTCT CTACAAATAA GGTAGCTACT CAAAGTTCAA CCAATTATGA GGGTGTAGCT 112 80 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT G ACT CAT AC T 1134 0 

AAGGCAGATT CTAACGCTTG GTGGCAGGTC GATCTGGGAG AAGAGTTTAC GGTTTCTAAA 11400 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCGTT TATCTAATTT TGATGTTATT 114 60 

TTTCTATCTT CATCAGGAGA AGAAGTTTTT AGAAGACATT TTGATAAAGT AGTTGATGGT 11520 

TTGTTATCTT T AAAAGT AC C TTCTGTAGGG GCTAAGCTAG TCAAAATAGA ATTAAAATCA 11580 

GCAGCTATTC CGTTAAGTTT AGCGGAAGTT GAAGTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAACTTTCTA ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATGGT 117 00 

TTTTCTCGTC TAG C AGT TG A TGGAAATAAA AACGG AG AT T ATGGTCATCA TTCAGTGACT 117 60 

CATACCAAAG AAGATTCTCC TTCATGGTGG GAGATAGATT TAGCACAAAC CGAAGAATTA 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AG AG ATT AT C AAATTTTGAT 118 80 

ATTATTATAT ATGATTCAAA TG AT TAT G AA GTTTTTACAC AACATATTGA CAGTTTAGAA 1194 0 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAAGGTTAG AATTTCTTTG 12 000 

AGAAGCGCAG GAATTCCTTT AAGTTTAGCA GAGGTAGAGG TTTATACTTA TAAGTAATTT 12 0 60 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AAGAAACAGA 1212 0 

AAAATAAGAG GAAAATAGTA TGATTCAACA TCCACGTATT GGGATTCGTC CGACTATTGA 12180 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAG 1224 0 

TGTGGCAGAT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATGCGT 12 3 00 

GATTTCTCCA TCTACTATTG GCCGTGTACC AGAGGCTGCA GCTTCCCATG AGTTGTTTAA 123 60 

AAAATCAAAT GTTTGCGCAA CAATTACAGT T AC AC C AT GC TGGTGTTATG GTAGTGAAAC 1242 0 

T ATGG AT AT G TCTCCAGATA TTCCTCATGC TATTTGGGGA TTTAATGGGA CAGAACGCCC 12480 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT CAAAAAGGGA TTCCAGCCTT 12 540 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC TAGTGACACA GAT ATT C C AG AAGATGTCAA 12 600 
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AG AAAAACT T 


TTACGCTATG 


CGCGTGCAGC 


TCTTGCAACT 


GGCTTGATGA 


GAGACACTGC 


12660 


TTACCTATCA 


ATGGGTAGTG 


TTTCGATGGG 


GATTGGTGGT 


TCTATTGTAA 


ATCCGGATTT 


12720 


CTTCCAAGAA 


TACTTAGGAA 


TGCGAAATGA 


ATCGGTAGAT 


ATGACGGAGT 


TCACGCGCCG 


12780 


TATGGACCGT 


GGTATTTACG 


ACCCTGAAGA 


GTTCGAACGT 


GCGCTCAAAT 


GGGTGAAAGA 


12840 


AAACGTAAAA 


GAAGGATTCG 


AC C AT AAC C G 


TGAAGACCTT 


GTTTTAAGCC 


GTGAAGAAAA 


12900 


AGATAGACAA 


TGGGAATTTG 


TTATTAAGAT 


GTTCATGATT 


GGACGTGACT 


TAATGGTTGG 


12960 


TAACCCAAGA 


CTTGCTGAAC 


TTGGTTTTGA 


GGAAGAAGCG 


GTTGGTCACC 


ATGCTTTAGT 


13020 


AGCTGGTTTC 


CAAGGTCAAC 


GTCAGTGGAC 


AG AC CAT T T T 


CCAAATGGGG 


ACTTTATGGA 


13080 


AACTTTCCTC 


AATACTCAGT 


TTGACTGGAA 


TGGTATTCGA 


AAACCATTTG 


TATTTGCGAC 


13140 


AGAGAATGAT 


TCACTAAATG 


GTGTGTCTAT 


GCTCTTTAAT 


T AT CT ATT AA 


CAAATACTCC 


13200 


ACAAATCTTT 


GCTGATGTGC 


GTACTTATTG 


GAGCCCAGAG 


GCTGTTAAAC 


GTGTAACGGG 


13260 


ACATACTTTA 


GAGGGTCGTG 


CTGCAGCTGG 


CTTCTTACAT 


CTAATCAACT 


CTGGTTCTTG 


13320 


TACATTGGAT 


GGTACAGGTC 


AAGCTACTCG 


AG ATGG C AAA 


CCTATTATGA 


AACCATTCTG 


13380 


GGAGTTGGAA 


GAAAGTGAAG 


TGCAGGCTAT 


GCTTGAAAAT 


ACAGACTTCC 


C AC C AG C AAA 


13440 


CCGCGAATAC 


TTCCGTGGAG 


GAGGATTCTC 


AACTCGTTTC 


TTGACGAAGG 


GGGATATGCC 


13500 


AGTAACAATG 


GTACGTCTCA 


ATCTTCTAAA 


AGGGGTTGGT 


CCAGTGCTAC 


AAATTGCAGA 


13560 


AGGTTACACA 


CTTGAACTTC 


CTGAAGATGT 


TCACCATACT 


TTAGATAATC 


GTACAGATCC 


13620 


AGGATGGCCA 


ACTACTTGGT 


TTGCTCCACG 


TTTGACAGGA 


AAAGGTGCTT 


TCAAGTCTGT 


13680 


CTATGACGTC 


ATGAATAATT 


GGGGAGCTAA 


TCACGGAGCC 


AT AAC AT AT G 


GACACATTGG 


13740 


AGCAGACTTG 


ATTACCTTGG 


CTTCTATGTT 


GAGAATTCCT 


GTCAATATGC 


ATAATGTACC 


13800 


TGAGGAAGAT 


ATCTTTAGAC 


CTAAAAATTG 


GTCCTTATTT 


GGAACAGAAG 


ATCTAGAATC 


13860 


AGCAGACTAT 


CGTGCATGTC 


AGTTGTTGGG 


GCCACTACAT 


AAATAAAACT 


TGTTTATATA 


13920 


GGAGGTGAAC 


TTACGTCCCT 


CCTATCCTTT 


TAAAAAGATT 


TGTTAAACAA 


TTCACAAATA 


13980 


AT TG AAAACG 


AATACAAAAA 


GTAATATAAT 


GATGTTAAAT 


AGATAGCGCG 


GAGGCGCAGG 


14040 


AGGAAAATTA 


TATGGCTATA 


TTTTATGTTC 


CGGCAGTCAA 


CCTTATTGGA 


AAAGGTGTTG 


14100 


TAAATGAAGT 


GGGTCCTTAT 


ATCAAGGAAC 


TTGGCTATAA 


AAAGGCACTT 


TTGGTGACAG 


14160 


ATAAGTACAT 


CGAAGGCAGT 


GATATTTTAC 


CTAAGACTTT 


AAAACCACTG 


GATACAGAAG 


14220 


GAATCGAATA 


T 










14231 


(2) INFORMATION FOR SEQ ID NO: 82: 









(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 169 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

AGTTCTCTTA ACTTTTTTAG GATGGCATTC TCCGCTCTCA GG TACT CAT T TTCTGCTgAA 60 

GACGTTCTAA TTCTGTCCTC TCTTCAGGTC TCGTTTTTGG CTTACGTCCC ATTTTAGGTA 12 0 

CTCTCCCTCT TGTTTTCTCA ACAATAGTAT ACCCGTTTTT CCTGTATTGT GCTAGCCAGT 180 

TAAGAAGTAT CGTACGACTT GGGAGACCGT ATTCAAGAGA AACTCTATCT TTAGTCCAGC 24 0 

CTTCATGTCA GACTTTATTA CTCATTTCTT GTTTTAAATC AGGAGAATAG TAACGATTTT 300 

TTCCTTTTTT GACGAACTCT ATTCCGTAAC GATCAATCAA TTTAATCATG TACCTAATAT 3 60 

TAGAATTGCT TATCCCAAAT TTATTTGAAA GCTTCTCTAA GCTATATCCT TGTTTTCTAA 42 0 

GTTCATAGAT CTGAACTTTA TCATCATAAG TTAGTTTCAT AATAAAAACA CCCCAAAAGT 4 80 

TAGATTTTTT CTGTCTAACT TTTGGGGTGT AGTTCATGTA CACCTGATAT GATGCGTTTT 540 

ATAATTTTTA AGCCTTTTTG CCCAGCCTCG TCAAAAGTAA TGTTTTGACA CAAAATCTGT 600 

GACAAAACTT TAGTTTTAAA GGTTTTTAAC TTTGTATATA CTAGTTTTAA GAAAAGGAGG 6 60 

ATGATCTAAT GGAAGAAAAA GTATCATTGA AAGTCAGGGT TCAAAAACTA GGGACATCGC 72 0 

TTTCAAATAT GGTTATGCCC AATATTGGAG CATTTATTGC TTGGGGAGTA TTGACTGCCC 7 80 

TCTTTATCGC TGATGGCTAT CTGCCAAATG AACAGTTAGC TACTGTTGTT GGTCCTATGT 840 

TAACGTATTT ATTGCCAATC CTGATTGGTT ACACAGGTGG AT AT AT G ATC CATGGCCAAC 9 00 

GTGGTGCCGT TGTAGGAGCT ATTGCTACTG TTGGTGCAAT CACAGGTTCT AGTGTTCCTA 9 60 

TGTTTATCGG AGCTATGGTA ATGGGCCCAC TGGGAGGATG GACTATCAAG AAATTTGATG 102 0 

AG AAGTT C C A GGAAAAAATT CGTCCCGGAT TTGAAATGTT AGTTAATAAC TTCTCAGCTG 1080 

GTCTCGTTGG TTTTGCATTA TTGCTTTTGG CTTTCTACGC AATCGGTCCA GTCGTATCGA 114 0 

CTCTTACTGG AGCTGTTGGG AATGGTGTTG AGGCTATTGT CAATGCTCGC CTCCTTCCTA 12 00 

TGGCTAATAT T ATC ATC G AA CCGGCTAAAG TCCTTTTCCT CAATAATGCC CTCAATCATG 12 60 

GCATTTTTAC TCCTCTGGGA GTAGAACAGG TAGCTCAAGC TGGTAAGTCA ATTCTCTTCC 132 0 

TATTGGAAGC TAATCCTGGA CCAGGTCTGG GAATTCTATT AGCTTATGCT GTATTCGGTA 1380 

AAGGTTCTGC TAAATCTTCT TCTTGGGGGG CAATGGTTAT TCATTTCTTC GGAGGGATTC 1440 

ATGAAATT T A CTTTCCTTAT GTTATGATGA AGCCTACTCT ATTTTTAGCT GCTATGGCAG 1500 
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GAGGTATCTC TGGAACTTTT ACTTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCCAGCTT 15 60 

CACCAGGTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTGTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTGTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCGC TCGAAGCTGC TCAGGCGGCT ACCCAAGCAG 17 4 0 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTGT TGATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATC AT T TTCGCCTGCG ATGCTGGTAT GGGAAGCTCT GCTATGGGAG 18 60 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCAGTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 19 8 0 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2 040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGGGC TGTGAAACGA 2 22 0 

TTCGGGCTAT TTTTAGAAAC AAGAATATTC GTATTCCAGT TTCTACTGCC AAAATTT C AG 2 2 80 

AATTAGGTGA ATTTAATTCT AAAAACATAA TGATTGTAAC AACTATTTCT TTACAGGCAG 2 34 0 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA GTAACAACAC 24 00 

CAGAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 2 460 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TAGGGAAGCT 2 52 0 

TTCAATGCAA G AT ATG ACT G AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2580 

ATC AG AT TTG ACAGATAGCA TGGAGCAATA TGGAATCGAA ATAACGAAGC ATGGGAAATA 2 64 0 

CTATATTTTG ACTGGAGAGT TGGATGATTT GCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2 7 00 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA GTGGTTTTGT 2 7 60 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA T T C AGG AT AT 282 0 

TTCAGATATT GATAAGCGTC TTTTAGACTT TGATCTGAAA ATTGAACGAC AAAAAGGTTA 2880 

TCGGATTTCT GGTGAT T C AG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 294 0 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA TAAGCAACTG TCAGGTTTTC CAGATATGGA 3 06 0 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG AGCAAAACAT 312 0 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGGAAATT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTTATAG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3 240 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCGCTC TTTACGGAGA AATTTGATGG 3 300 
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TGAATTTTTC TACAATATTT CAAATCTGAT TGATACGGTT TCCATGTATA CCAAG AT T G A 3 3 60 

CTTTTTTAAG GACAAGGTTT TATTCAATTT TCTTTTCCAT CATATTCGGC TCAGTTTAGG 342 0 

CGTCCCTATC CTTTTTCAGG GTGAAAATTT GCCAGAATCT ATCCAGATTT TAGTTGAAAG 3 4 80 

GAATAAATTT C T T TAT AC AG TCATCAGTCT TTTAGTGAAT GATATTTTTC CGAAATATCT 3 54 0 

T CAT AC AG AG TATGAGTATG GCATGATTGC CCTACATTTT ATCTCTAGCT TAGGCCGTAG 3 600 

T C C AG AG ATT TATCCAGTCC GTGTTTTGCT TTTAACGGAT GAACGTCGGG T C AC TAG AG A 3 6 60 

TTTATTAGTC AGTAAAATTA AGAGTGTTGC TCCTTTTGTA GAGTTGATAG ATATTCAATC 372 0 

TCTAGTAGAT T AC C AC AGT A TTGATCTCAG TCAGTATGAT TATATTTTAT CTACCAAGCC 378 0 

GCTGACTAAT CAGGAAATCG ATGTAATTTC TAGTTTTCCA ACCGTCAAAG AATTGCTTGA 3 840 

AT T AC AGGAA CGACTTCAGT ATGTACAGGC ACATCGTACA ATTGTCGCGC GTGATGCTAT 3900 

CGCTCCAGAG AAAAGTTATG ACTTGCAAGA TTATTTAATA TCTAGTAGTC AGCTTTTGAG 396 0 

TCAATTCGAG TTGGTTCAAT TGGAGAATAA TCAATCATTT GAGCACACGG TAGAACAAAT 402 0 

CATCCAATAT CAGAAGAATG TGAGTGACAG AGCTTACCTA ACAAGAAAAT TGTTATCTCA 4080 

CTTCCAGAAT AGTCCTATGG CTATTCCTAA TACTGGTCTG GTGCTTTTAC ATAGTCAGTC 414 0 

TAGCAAAGTA ACAACAAATA GTTTTACTAT GTTTGAACTC AAACTACCTA TCTCCGCATT 4200 

GTCAATGAAA CGAGAGGAAG AAGAGGTCAA AAGGTGTCTG CTAATGCTAA TGTCTAAAGA 42 60 

AGCTAGCGAG GAAGCTAGAG ATTTAATGAC AG CT ATT AGT CAGTCGATTA TTGAAAATCA 43 2 0 

TCTTTATACA GAGATTTACA AGACGGGAAA TCAATCCATT ATTTATCAGA TGCTAAATAC 4380 

TATTTTTAAC GAAAAAATTA AGAAATTGGA GAACTAATAT GAAACTTGAA AAACATTTGA 4440 

TTAAGCTTAA TAAACAATTT TCTAACAAGG AGGAAGCTAT TTGTTATTGT GGGCAAGTTC 4 500 

TTTATGAGGG TGGATATGTT AATGAAGACT ATATTGAAGC CATGATTGAG CGAGATAAAG 4560 

AGCTATCTGT TTACATGGGT AACTTTATCG CCATACCGCA TGGAACAGAT GCAGCAAAAA 4 62 0 

ATGATGTCCT CAAGTCTGGT ATTACAGTCG TTCAAGTCCC TAGAGGGGTT GATTTTGGGA 4 6 80 

ATGTATCTAA CCCTCAAGTG GCAACGGTTC TTTTTGGTAT TGCTGGTATT GGTAATGAAC 4 740 

ACT T AG AAAT TATTCAGAAA ATTTCTATCT TCTGTGCAGA TGTAGATAAT GTTCTTAAAC 4 800 

TAGCAGATGC TCAGTCAAAA GAGGAAGTAT TGCGCTTATT TGATGCTGTT GAATAATTGA 4 860 

ATTTAGTCAT TTGTCATCTA GTATATATGT CCCTCAAATA GGAAAAGGAG AAATTGAATG 4920 

AAACATTCTG TTCATTTTGG TGCCGGTAAT ATCGGTCGTG GTTTTATAGG TGAAATTCTA 4980 

TTTAAAAATG GTTTCCATAT TGATTTTGTG GATGTCAATA AT C AG AT AAT TCATGCTCTG 5040 



WO 98/18931 



PCT/US97/19588 



654 

AATGAAAAGG GCAAGTATGA AATTGAAATT GCACAGAAAG GACAGTCTCG TATAGAAGTA 5100 

ACTAATGTGG CTGGCATTAA TAGCAAAGAA CAT C C TG AG C AAGTCATTGA AGCGATTCAA 5160 

AAGACGGATA T TAT TACT AC TGCAATCGGA CCTAATATAC TCCCTTTTAT CGCCGAACTT 522 0 

CTAGCCAAAG GAATCGAAGC TCGCCGAGTT GCAGGAAATA CACAGGCATT GGATGTTATG 52 80 

GCCTGTGAAA ATATGATTGG CGGGTCTCAA TTTCTTTATC AAGAAGTCAA GAAATATTTA 5340 

AGTCCGGAAG GTTTGACATT TGCTGATAAC TACATAGGTT TTCCAAATGC TGCAGTAGAC 54 00 

AGGATTGTTC CAGCACAAAG TCACGAAGAT TCCCTTTTTG TTGTGGTCGA GCCCTTTAAT 54 6 0 

GAATGGGTCG TGGAAACCAA GCGTCTTAAA AATCCAGATT TACGTCTAAA AGATGTGCAT 5520 

TATGAAGAAG ATTTAGAACC CTTTATTGAG CGAAAACTTT TTTCAGTCAA TTCTGGACAT 5580 

GCAACTTCAG CTTACATTGG TGCGCATTAT GGTGCCAAGA CAATTTTGGA AGCTCTTCAA 5640 

AATCCTAATA TTAAATCTCG GATTGAATCT GTATTAGCTG AAATTCGGAG TCTCTTGATT 5700 

GCCAAATGGA ACTTTGATAA AAAAGAATTG GAGAATTATC ACAAAGTCAT T AT AG AAC G A 57 60 

CTTGAAAACC CTTTCATAGT GGACGAGGTT AGTCGCGTAG CTCGTACTCC AATCCGAAAA 5820 

TTAGGCTATA ATGAACGATT CATCCGGCCG ATACGTGAAT TGAAAGAACT CAGTTTGTCA 5880 

TATAAAAACC TACTTAAAAC AGTTGGCTAT GTCTTTGACT ATCGCGATGT AAATGATGAA 594 0 

GAAAGTATTC GATTAGGTGA ATTGTTGGCT AAACAATCAG TCAAAGATGT TGTTATACAA 6000 

GTTACAGGTT TAGACGACCA AGAATTGATT GAGCAAATTG TAGAGTATAT TTAATCTTTT 6060 

TCGAAAATCT CTTCAAATCA GGTTAGCATC GCTTTGTCTT AGGCATATGT TGTTCTATCT 612 0 

ACAACCTCAA AGCAGTGCTT TG AG C TG AC T CCGTCAGTCT TATCTGCAAT CTCAAAACAC 6180 

TGTTTGAGTT ATCTGCGGTA ATCTTTCTAG CTTGTCTTTG ATTTTTGTTG TTATTTATAA 624 0 

GGTAAAAGAA GCTGGACAAA AAGTCTTCAA AATCGGGAAA AGGCAGCCTA TCGGGTGTTC 63 00 

AAAAATCTTG ATAGGATGTC CTTTATTATG GAAAGCCTTA TTGGATTTTC TCCTCAGATT 63 6 0 

GAGTTTTTGA TCAGCTTTAT GAGATAGGTC TTGCTAGAGA TGT AGC C CAT CATGTTATTT 642 0 

TTATGGACAG TGGGAAAATT GTTGAAAAAA ATAATGCCCA TCAATTCTTT AGTCGTCCAA 64 80 

GAGAAGAACG AACCAAGCAA TTTTGGAACG AATTCTTTCG AATGCGATCT ATATAGTAAA 6540 

ATGAAACAAG AACAGGACAA ATCGATCAGG ACAGTCAAAT CGATTTCTAA AAATGTTTTA 6600 

GAAGTAGAGG TGT ACT ATT C TAGTTTCAAT CTACTATATA AC TG AAAAAT TAGATAAATT 6660 

AGTTTTGGAA AATGACTAAC CAAAAGATAT CCAAAGTAGT CTAAAATTGT CTATACTTTA 6720 

TGAGTGTTTT AGTTAGGAAA AAGGCTTGTT GTCTATAATT GTCTGCATTA GTCTAGATTT 67 80 

TATTTATAGA AAATGTTATA ATAGACTGTA TTTAAAAAAT TTTAAGGAGA AATGACAGAA 6840 
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TGTCTGTATC ATTTGAAAAC AAAGAAACAA ACCGTGGTGT CTTGACTTTC ACTATCTCTC 6 900 

AAGACCAAAT CAAACCAGAA TTGGACCGTG TCTTCAAGTC AGTGAAGAAA TCTCTTAATG 6 960 

TTCCAGGTTT CCGTAAAGGT CACCTTCCAC GCCCTATCTT CGACCAAAAA TTTGGTGAAG 7 02 0 

AAGCTCTTTA TCAAGATGCA ATGAACGCAC TTTTGCCAAA CGCTTATGAA GCAGCTGTAA 7 080 

AAGAAGCTGG TCTTGAAGTG GTTGCCCAAC CAAAAATTGA CGTAACTTCA ATGGAAAAAG 7140 

GTCAAGACTG GGTTATCACT GCTGAAGTCG TTACAAAACC TGAAGTAAAA TTGGGTGACT 72 00 

ACAAAAACCT TGAAGT AT C A GTTGATGTAG AAAAAGAAGT AACTGACGCT GATGTCGAAG 72 60 

AGCGTATCGA ACGCGAACGC AACAACCTGG CTGAATTGGT TATCAAGGAA GCTGCTGCTG 7 32 0 

AAAACGGCGA CACTGTTGTG ATCGACTTCG TTGGTTCTAT CGACGGTGTT GAATTTGACG 7 380 

GTGGAAAAGG TGAAAACTTC TCACTTGGAC TTGGTTCAGG TCAATTCATC CCTGGTTTCG 7440 

AAGACCAATT GGTAGGTCAC TCAGCTGGCG AAACCGTTGA TGTTATCGTA ACATTCCCAG 7 5 00 

AAG ACT AC C A AGCAGAAGAC CTTGCAGGTA AAGAAGCTAA ATTCGTGACA ACT AT CC AC G 75 60 

AAGTAAAAGC TAAAGAAGTT CCGGCTCTTG ACGATGAACT TGCAAAAGAC AT TG AT G AAG 7 62 0 

AAGTTGAAAC ACTTGCTGAC TTGAAAGAAA AATACAGCAA AGAATTGGCT GCTGCTAAAG 7 680 

AAGAAGCTTA CAAAGATGCA GTTGAAGGTG CAGCAATTGA TACAGCTGTA GAAAATGCTG 7740 

AAATCGTAGA ACTTCCAGAA GAAATGATCC ATGAAGAAGT TCACCGTTCA GTAAATGAAT 7 8 00 

TCCTTGGGAA TTTGCAACGT CAAGGGATCA ACCCTGACAT GTACTTCCAA ATCACTGGAA 7 8 60 

CT ACT C AAG A AGACCTTCAC AACCAATACC AAG C AG AAG C TGAGTCACGT ACTAAGACTA 7 92 0 

ACCTTGTTAT CGAAGCAGTT GCCAAAGCTG AAGGATTTGA TGCTTCAGAA GAAGAAATCC 7 980 

AAAAAGAAGT TGAGCAATTG GCAGCAGACT AC AAC AT GG A AGTTGCACAA GTTCAAAACT 804 0 

TGCTTTCAGC TGACATGTTG AAACATGATA TCACTATCAA AAAAGCTGTT GAATTGATCA 8100 

C AAGC AC AG C AACAGTAAAA TAATCTTAAT AAACAGAAAA CCCACCTGAA TTGGTGGGTT 8160 

TTCTGATGCA CTATTTTCCA AAAATCTCTT TGAGGTCTGT GTCTGTAATC CCAATCATGG 822 0 

CTGGGATGCG GTCCCAGTTT TCTTCGGTTA GGATGTAGGA TTGTTCAGAG GCACTTGATG 82 8 0 

TGACTGTTTC AG AG AC AG CT TGTTGCTTTT CT T C AAC ATT CTCCAGTAGA T C AC TG AAG C 8340 

GTTCAATCAG ATAGGTTTTT CGGGCAGTTC CGATGTGTTG GGTAGCATAG TCGAAGGCTT 84 00 

GTAATTCGCC TAGTAAGATG AGTTTGCTTT TGGCACGTGT AATGGCTGTG TAGATGAGAT 8460 

TTCGCTCCAG CATACGTCGG CTAGCACTAG TAATCGGTAG GATGACAACT GGGAACTCAC 8 52 0 

TTCCCTGAGA CTTATGAATA CTCATGGCAT AGGCCAAGCG AATCTTGTAC C ATT CGTT AC 8580 
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GGGGGTAAGA GACTTCATTA C CAT C AAAAT CAATGACAAT CTCGTCTTGT TTCGATTCGG 8640 
TGTATTTACC AGGAATCAGG TCTGTGATAG CTCCTAAATC CCCATTAAAG ACATTGATTT 87 00 

CAGCATCGTT AACCAAATGA ATGACCCTGT CTCTCTTACG AT AGT G AC AC TGAGGAGCTT 87 60 

CAAAACTGAG TTGATCTTTT TGTGGGGGAT TGAGCAGGTC TTGCATGAGC TGATTGATAG 8 820 

CATCAATCCC TGCCGTCCCT CGGTACATAG GAGCCAGAAC TTGGATATCA CGGGCGGGAA 8880 
TACCATTTCT GAGGGCGGCA CCTAAGATTT TTTCAATGGT GGCAGGAATA TGGCCACTAG 8940 
CAATTTCAAA GTAGGAACGG TCAGCTTTTT TTTGGGTGAA ATCAGCTGGC AAGATGCCCT 9000 
GTCGAATCTG ACTAGCTAGG GTGACGATGG TTGATTCTTT GCTTTGTCGA TAAATTTTTT 9060 
CCAAGCGAGT CTGAGGAATC AAAGGAATAT GAAGTAGATC CGCTAGAACC TGTCCAGGAC 9120 
TGACAGAAGG TAGCTGATCA CTGTCACCTA CGATGAGGAT CTTACTGTTA GAAGAGATAT 9180 
TGGAGAAGAG TTGATTGGCC AGCCAAGTAT CTACCATAGA GAATTCATCC ACGATGATAA 92 40 

AGTCAGCATC TAGGTAATCT TCCAGATGAC TGGT AT CAT C GTCACCTGTC ATTCCCAAGT 9300 
GGCGATGTAT GGTCGCGCTA GGCAAACCTG TCAATTCATT CATGCGACGA GCAGCTCGAC 93 60 

CAGTTGGAGC AGCAAGAAGA ATGGGCAGAT TGCTTTTCTT CCTGAAGTCA AGTCCTTCTA 9420 
AAAGGGCATA AACAGCAATG ATTCCATTGA TAACAGTTGT C TT AC C AGT A CCAGGCCCAC 94 80 

CTGTCAGGAT AAAGACCTTA TTCTGGATAG CATCACAGAT AGCCTGTTTT T G AATGTT AT 9540 
C AT AC TCAAT TCCCAGTTCT TGCTCGACAG T AGT GAT AT G TTTTTGAATG GTTTCTAAAT 9 600 

CATGACTCTT CTGTTTTCCT TTTTCAAGGA TACGAACCAA GTGACTGCGG ATGCCTTCCT 9660 
CAGCGAAAAA GAGGCTGTTG TCAAAGATCT TGGTATCAAT CTGCTGAACC TTGTCTTCTT 9720 
CGATCAGGTA GGAGAGCTCT TGGGCAACTT GGCTGGGGTC TAGTTCCACG GGACGGGAAG 9780 
ACTCAAGGAG AGTAAGGGTT TGTTCCAGCA AATCCCGTGC TTCAACATAG GTGTCCCCTG 9840 
TTTCCATACA GGCCTGAAAA AGACTGTGAA CTAGACCGGC GCGGAAGCGT TCAGGAGCCT 9900 
GACTTTCGAT GCCTAGTTCC TCAGCTAGTT GGTCAGCAAT GGTAAAGCCC AAACCCTTGA 9960 

TATCCTCAAC CAGCTGGTAG GGATAATTTT CAACCACATC AAGGGTTTCT TCCTTGTAAA 1002 0 

AGTCTTGAAT CTGAAAGGCT AGTTTGTTGG GAATGCCGTA GTTGGCTAGT TTGGCCAAAA 10080 

TCATCTCCGT TCCGTAGTTG AGACGGAGAG TGGAGACGAA AGCCTCGCGA TTTTTGGCAG 1014 0 

AGAGTCCTGC GATGCCTTCT AACTTTTCTG GGTGTTGCAA AATTTCGTCA ATGGTATTTT 10200 

CGCCATAGGT ATCCACGATT TTCTGAGCTG TCTTGAGACC AATCCCCTTG AAATGGCTAC 10260 

TTGAAAAGTA CTTGACCAAG CCCTTACTAG TTGGTTTTGC GCGATCATAA CGACTGATTT 1032 0 

GCAGTTGTTC TCCATACTTG GAGTGCTGGA CAATTTGCCC C C AAAAAGT A TAGTCTTCGC 103 8 0 
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CCTCAATTAC ATCAGCCATG GTTCCTGTGA CAATGATTTC AAAATCATCA AAATCCTCTG 10440 

CGTCCGTATC GTCGATTTCT AGGAGGAGGA TGCGATAAAA ATTGCTGGGA TTTTCAAAAA 10500 

TAATCCGTTC AATAGTTCCT GAAAAATAAA CTTC C AT AAA ATTCCTTTGC ATGAATAGGT 10560 

GAGAGTTGGG ATTGTTTTTA TTTTATACTC TTCGAAAATA TCTTCAAACC ACGTCAGCTT 10 62 0 

CCATCTGCAA CCTCAAAACA GTATTTTGAG CTGACTTCGT CAGTTCTATC CACAACCTCA 10680 

AAACACTGTT TTAAGCAGCC TACGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTCATTGAG 10740 

TATTTGTAAA TAAACAATCA CTTCTCACGA TAGAAGAAGA GGCTGAGATT GGTGATTCTC 10800 

TGCCTCTTAG GTTTCTTAAA ATGTTCCGAT ACGGGTGATT GGCCATAAGC GGAATTTAGC 108 6 0 

TTCCCCTGTG ATATCTTTTG CTTTGAAGGT ACCTACGTGG CGGCTGTCGC TCGAAACCAA 1092 0 

GCGGTCATCT CCGAGGAGAA GGTATTCTCC TTCTGGAACA GTAAAGCTAA AGTTGGTGTT 10980 

GTAGTTGACA TCAACTGTGA AGGCTTGAGC TTTTTGAGCG ATACTTCTAA AGAAAGTTCC 11040 

TTTATTTCCT TCAAAGCCCT TGCCTGAGTA AGTGCTTTGG AGTTTGTCAT CCTTGAAGCG 11100 

TTTGATATAG TCTGCTAGAT AAGGCTCGTC CGTTTCTTTG TCATTGATGT AGAGTTTATC 11160 

ATTTTCGTAA CGAATGGTGT CGCCAGGCAT TCCAATCACG CGCTTGACGA TGTCCTTATT 112 20 

GCCATCTTCC TCATGGGCCA CCACGATATC AAAACGGTCA ATAGGAAGGT GTTTTACAAC 112 80 

GAAGAGAATT TCGCCATCCG CTAGGGTCGG ATCCATGGAA TGTCCTTCTA CGCGAACATT 11340 

GCTCCAAAAA AAGATACGAC TTAAAGCTAG TAATGACAGA ATTAGGAGGA ACAATCCCCA 114 00 

CTCTTTTAAG AAATTTTTAA ATGAATTCAT AACTTACCTT TCTAAGCGTT TTTTCGCTTT 114 60 

TTCAGTGTTT TTAAAGTGCA ATTTGGCGCA GAAGCTGAGT CCCTGCATAC CATAGGCTTG 11520 

CAAAATCTGG CTAGCCACCT TGTCAGAAGC CGTTCCAGCT C C ACTTGGG A GCTGATAACC 115 80 

CAGTTCTCGT CCCAAATTTT CAAGATTTTC CAGAAAGAGA TCACGCGCAA TGACAGAAGA 11640 

AACTG CG AC A G AC AAGTAT T TGCCCTCAGC CTTTTCTTCT AAGCTGATAG GATTGCTGAA 117 00 

ACGATTGGCC TCTTGTGCCA AGTACTTGTC ATAATTTTTA GCACTGGTAA AGGCATCAAT 117 60 

CACAATTTTC TCAGGCTGAA CACCTTTTTG AAGGAGGAGA TAG AT AG C CT GATTATGGAG 1182 0 

GGCAACCTTA ACCGAAACAG CGTTGTAGCG GTCTCCGATG ACCTCGTTGT ACTTGCTGGG 11880 

TGAGAGAAGG AGTGCCTGGT GCTGAATTTT TTCCTTGAGA ATAGGAGTAA TCTGACGGAT 1194 0 

CTTTTGGTCG GTCAGAGTCT TAGAATCCCC CACACCGAGT TTTCGTAAAA AGTCGTGCTG 12 000 

GTCAGGTGTG ACAAAGGCAG CCACAACTGC AAGCCCACCA AAGTAGGAAC CATTTCCCAC 120 60 

CTCATCTGTC CCAATTAAAG GAAGATTTTG TCCGCTGGTT TGCTCTACAG CTTGATAGCC 1212 0 
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AAAG AAAC TG GCGTATTTTT CAGCCCCTTC ACCCTGAAGC AAGATTTTTC CAGAAGTATA 12180 

G AT AG AAAC C GTTGCTTGAG GTAGTTTCAA AAAGTAGCGG ATATAGGGAT TCTTGCTAGG 12240 

AG C C AG ACTG GTTTGATAGT GTTCAAGAAA AGCCTGAATA TCCTTTTCGC TTGGTGTGAG 12 300 

TGTGATACTT GCCATAGTTT CTATTGTACC ACAAAAGCAG TAAAATTTGT AAAAAC TG AC 12 3 60 

AAAATTAGCG AATTTTGGTA TAATATCGTG AGGTGAATTT TATGGCAAAT CTAAATCGAT 12 42 0 

TCAAATTTAC ATTCGGGAAA AAATCGTTAA CCTTGACAAG CGAACATGAC AACCTTTTTA 124 80 

TGGAGGAAAT CGCTAAGGTT GCGACAGAAA AATACCAAGC AATTAAAGAA CAAATGCCTA 12 540 

GCGCAGATGA TGAAACAATC GCTCTTTTGT TGGCAGTCAA CTGTTTATCA ACTCAGCTCA 12 6 00 

GCCGTGAGAT TGAATTTGAC GATAAGGAGC AAG AG C TAG A AGAACTCCGT CACAAGCTTG 12 660 

TGACTTGTAA G C AAG AAC AG AGCAAGATTG AGGATTCCTT ATGATTTCAT TCCTTCTTCT 12 72 0 

ATTGGTCTTG GTTTGGGGAT TTTATATCGG CTATCGGAGA GGCCTGCTCT T AC AGGTT T A 12 7 80 

TT AC C T GAT T TCAGCCATGG CATCGGCTTT TATGGCTGGC CAGTTTTATA AGGGGCTTGG 12 840 

AGAGCAATTC CATTTATTGC TCCCTTATGC AAATTCGCAG GAAGGTCAGG GGACTTTCTT 12 9 00 

TTTCCCATCG GATCAACTCT TTCAGCTGGA TAAGGTCTTT TATGCAGGTA TCGGCTACTT 12 960 

GCTTGTATTT GGGATTGTCT ATAGCATTGG TCGTTTACTT GGTCTTCTCT TACACTTGAT 13020 

TCCTAGCAAA AAACTGGGTG GTAAGTTGTT CCAAGTTTCA GCAGGTATCT TGTCCATGTT 13 080 

GGTGACCTTA TTTGTCTTGC AAATGGCCTT GACAATCTTG GCGACCATCC CCATGGCAGT 13140 

TATACAAAAT CCTCTTGAAA AGAGTATCGT CGCAAAACAC AT CAT C C AG A GCATACCGGT 132 00 

AACAACCAGT TGGCTCAAAC AAATCTGGGT GACAAATTTA AT C GG AT AAA AAGGGCAGGA 132 60 

GTTTTCCTAG CCCTTTGTTT ACAGATTTGA CTCGAATCTA TCAGAATGTA AAAAGCTACC 13320 

AC AC CT AG AC ATTCAAAGAC AAGGAAATAA AGATGAATAA G AAAAT AT T A G AAAC AT TAG 133 80 

AGTTCGATAA GGTCAAGGCC TTGTTTGAGC CTCATTTGTT GACCGAGCAG GGCTTGGAGC 1344 0 

AATTGAGACA ACTGGCTCCG ACT G C C AAAG CAGAT AAAAT CAAACAGGCT TTTGCTGAGA 13 500 

TGAAGGAAAT GCAGGCTCTT TTCGTCGAGC AACCGCATTT TACTATTCTC TCAACTAAGG 13 5 60 

AAATTGCAGG AGTCTGCAAG AGGTTGGAGA TGGGAGCGGA TCTCAATATC GAGGAGTTCC 13 62 0 

TACTCTTGAA ACGCGTGCTT CTTGCCAGCC GAGAACTTCA AAATTTTTAC ACCAATCTGG 13 680 

AAAATGT C AG CTTGGAAGAA TTAGCCCTTT GGTTTGAGAA ATTACATGAT TTTCCGCAAT 13 74 0 

TACAAGGAAA TCTTCAGGCC TTTAATGATG CGGGTTTCAT TGAAAATTTT GCCAGTGAAG 13800 

AATTGGCGCG AATCCGTCGA AAAATACATG AT AG CG AG AG TCAGGTACGC GATGTTTTAC 13 8 60 

AAGACTTGCT CAAGCAAAAA GCGCAGCTGT TGACGGAAGG AATTGTTGCT AGCAGAAATG 13 920 
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GCCGTCAGGT TTTACCAGTC AAAAACACCT ACCGCAATAA GATTGCAGGT GTCGTTCATG 13 9 80 

ATATTTCTGC TAGTGGAAAC ACCGTCTATA TCGAACCCCG TGAGGTAGTC AAACTG AG C G 14040 

AAGAAATTGC TAGTCTGCGA GCAGATGAGC GCTATGAAAT GCTTCGCATT CTCCAAGAAA 14100 

TTTCTGAGCG TGTCCGCCCT CATGCGGCTG AGATTGCTAA TGACGCTTGG ATTATCGGTC 14160 

ATCTGGACTT GATTCGTGCC AAGGTTCGAT TTATCCAAGA AAGACAAGCA GTCGTGCCTC 1422 0 

AGCTGTCAGA AAATCAAGAG ATTCAACTGC TCCATGTCTG CCATCCTTTG GTCAAAAATG 142 80 

CCGTCGCAAA TGATGTCTAT TTTGGTCAAG ATTTAACAGC TATTGTCATT ACAGGTCCCA 143 40 

ATACAGGTGG G AAGACC AT C ATGCTCAAAA CTCTGGGCTT GACACAGGTC ATGGCCCAGT 144 00 

CAGGATTGCC GATTTTAGCA GACAAGGGAA GTCGTGTTGG TATTTTTGAA GAAATCTTTG 144 60 

CTGATATTGG AGATGAGCAG TCTATTGAGC AGAGCTTGTC TACCTTCTCT AGTCATATGA 14520 

CCAATATCGT GGATATTCTT GGCAAGGTCA AC C AAC AT T C ACTCTTACTT TTGGATGAGT 145 80 

TGGGGGCTGG TACTGATCCC CAAGAGGGAG CAGCCCTTGC CATGGCTATT CTGGAGGACC 14 64 0 

TTCGCCTGCG TCAAATCAAG ACCATGGCGA CGACCCACTA TCCAGAACTC AAGGCCTACG 147 00 

GTATTGAGAC AGCCTTTGTG C AAAATGC C A GTATGGAGTT TGATACTGCA ACTCTTCGCC 147 60 

CGACCTATCG CTTTATGCAG GGTGTTCCTG GCCGAAGTAA TGCCTTTGAA ATTGCCAAAC 14 82 0 

GTCTAGGCCT ATCTGAAGTT AT CGT AGG AG ATGCCAGTCA GC AG AT CG AT CAGGACAATG 14880 

ACGTCAATCG TATCATTGAG CAATTAGAAG AGCAGACGCT GGAAAGCCGC AAACGTTTGG 14940 

ACAATATCCG TGAGGTGGAG CAAGAAAATC TCAAGATGAA CCGTGCGCTA AAAAAACTCT 15000 

ACAACGAGCT TAATCGTGAA AAGGAAACCG AGCTTAACAA GGCGCGTGAA CAGGCTGCTG 15 060 

AG ATTGTG G A TATGGCCCTA AGTGAAAGTG ACCAGATTCT CAAAAATCTC CACAGTAAAT 15120 

CCCAACTCAA GCCCCACGAA ATCATTGAAG CCAAGGCCAA GTTGAAAAAA TTGGCTCCTG 15180 

AAAAAGTGGA CTTGTCTAAA AATAAGGTCC TTCAAAAGGC CAAGAAAAAA CGAGCTCCAA 152 40 

AGGTGGGAGA TGATATCGTG GTTCTCAGTT ATGGTCAGCG TGGTACCTTG ACCAGTCAAC 15300 

TCAAGGACGG TCGCTGGGAA GCCCAAGTTG GCTTGATTAA GATGACCTTG GAAGAGAAAG 15 360 

AGTTTGATCT TGTTCAAGCC C AG C AAG AAA AACCAGTCAA GAAGAAACAG GTCAATGTTG 15420 

TGAAACGAAC TTCTGGGCGA GGACCTCAAG CTAGACTGGA TCTTCGAGGC AAGCGCTATG 154 80 

AAGAAGCCAT GAATGAGCTA GATACCTTCA TCGACCAAGC CTTGCTTAAC AATATGGCTC 15540 

AAGTTGATAT CATCCATGGT ATCGGAACAG GAGTCATCCG TGAAGGAGTT ACCAAATACT 15600 

TGCAAAGAAA CAAACATGTC AAGAGTTTCG GCTATGCCCC ACAAAATGCT GGAGGCAGTG 15660 
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GTGCGACTAT TGTCACTTTT AAAGGATAGC AGTATTCTGG ACTTTATAAA GTAAAAACTG 1572 0 

TTGAACTAAT TTTTACTAAT AAAC AC AT T G AC AAAAG CCA ACATTTTTTG T AAAAT TAG A 15780 

AT C AAT T AAA TACCAACACC GAATGAAGTT TAATAGAAGT GGGGAATCGT TTGATTTTCC 15 840 

ATGACTGTAA ATGGACGGAA CTCTGGAGAG ACCGTAAAGG CACCGAAGGG CAAGGCAGGC 15900 

AACTGCTCAA ACTCTCAGGT AAAAGGACAG AGCTAGGATA GACCGCTTTT TAGCATTTAT 159 60 

CTAAGCATTC CAGAGTACAT GTATCTTGCA TGTGCTCTTT CTTTTGGGGT TGAAACGATA 16020 

GGAGAAGGAA ATGTTAGAAT TGCTTAAATC AATCGATGCT TTTGCTTGGG GACCGCCCCT 1608 0 

CTTGATTTTA TTGGTCGGAA CAGGGATTTA CCTAACTATT CGGCTAGGAC TCTTGCAGGT 16140 

TTTGCGTCTA CCCAAGGCCT TTCAGCTTAT TT T TAT C C AG GATAAGGGAC ATGGTGATGT 162 00 

ATCCAGTTTT GCAGCTCTGT GTACAGCCTT GGCATCAACT GTTGGAACAG GAAATATCAT 16260 

AGGAGTTGCG ACGGCTATCA AGGTTGGTGG ACCAGGAGCT CTATTTTGGA TGTGGATGGC 16320 

GGCTTTCTTT GGAATGGCTA CCAAGTATGC GGAAGGACTC TTGGCCATCA AATACCGCAC 163 80 

CAAGGACGAC CATGGTGCAG TAGCGGGAGG TCCCATGCAT TATATCCTTC TAGGGATGGG 16440 

AGAAAAGTGG CGACCACTTG CTGTTTTGTT TGCAGTAGCA GGAGTATTGG TTGCTCTCTT 16500 

GGGAATCGGA ACCTTCACCC AAGTCAACTC GATTGCAGAA TCTATCCAAA ATACAACGAC 16560 

GATTTCGCCA GCCATCACAG CTCTCGTCTT GTCTGTCTTT GTAGCGATTG CAGTCTTTGG 1662 0 

TGGACTCAAG TCTATTTCTA AGGTTTCAAC TACTGTTGTT CCTTTTATGG C CAT CAT T T A 16680 

TATCTTAGGA AC TCTT AC AG TTATTTTCTT TAATATCGGA AAAATCCCTG GCACAATCGC 1674 0 

TTTAGTCTTT ACCTCAGCTT TTAGTCCCCT TGCTGCGGTA GGTGG AT TTG CTGGTGCTAG 16800 

CGTTCGGATG GCTATTCAAA ATGGTGTGGC GCGTGGTGTG TTCTCAAACG AATCTGGTCT 168 60 

GGGTTCTGCT CCTATTGCAG CTGCAGCTGC CAAGACAAAT GAACCAGTAG AGCAAGGTTT 16 920 

GATTTCCATG ACAGGAACCT TTATTGATAC CCTCATCATT TGTACTCTAA CTGGTTTGAC 16980 

CATCTTGGTA ACTGG 16995 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

CCGGGGCTTT TGTAGTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 60 
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ACTAAGCGCT TTAATAAACA GT TAG AT AAA ATTCAAGTTT CGTTGATTCG TCAGTTTGAC 12 0 

CAGGCTATTT CGGAGATTCC TGGGGTCTTG CGTTTGACCT TGGGGGAACC TGATTTTACA 180 

ACGCCAGACC ATGTCAAGGA GGCGGGCAAG CGAGCGATTG AT C AG AACC A ATCCTACTAT 24 0 

ACAGGGATGA GTGGTCTGCT GACTCTACGT CAGGCAGCCA GTGACTTTGT TAAGGAAAAG 3 00 

TACCAACTGG ACTATGCTCC TGAAAATGAA ATCTTGGTTA CAATTGGGGC GACAGAGGCT 3 60 

TTATCTGCGA CTTTGACGGC TATTTTGGAA GAGGGAGACA AGGTACTTTT GCCAGCTCCT 42 0 

GCTTATCCAG GCTATGAACC GATTGTTAAC TTAGTTGGGG CAGAAATTGT T GAG AT TG AT 4 80 

ACGACTGAAA ATGGTTTTGT CTTGACTCCT GAGATGTTGG AGAAGGCCAT TTTGGAGCAG 540 

GGTGATAAGC TCAAGGCGGT TATTCTCAAC TATCCAGCCA ATCCGACAGG AATTACCTAC 600 

AGTCGAGAGC AGTTAGAGGC CTTGGCAGCT GTTTTACGCA AGTACGAAAT TTTTGTTGTC 660 

TGTGATGAGG TTTACTCAGA ATTGACCTAC ACAGGCGAAG CCATGTGTCT CTAGGAACGA 72 0 

TGTTGAGAGA CCAGGCTATT ATT AT C AATG GTTTGTCTAA ATCGCATGCC ATGACAGGTT 780 

GGCGTTTGGG GCTGATTTTC GCTCCTGCGA CCTTCACAGC CCAGTTAATC AAGAGTCACC 840 

AGTACTTGGT CACTGCCGCA AATACCATGG CGCAACATGC TGCGGTAGAA GCCTTGACGG 900 

CTGGTAAAAA CGATGCGGAC CCATGAAGAA GGAATATATC CAACGTCGGG ACTATATCAT 9 60 

CGAAAAAATG ACTGCTCTTG GTTTTGAGAT TATCAAACCA GACGGTGCCT TCTATATTTT 102 0 

TGCTAAAATT CCAGCGGGCT ACAATCAAGA CTCCTTTGCT TTTCTGAAGG ATTTTGCTCA 10 8 0 

GAAGAAGGCC GTTGCCTTTA TCCCTGGTGC AGCCTTTGGA CGTTACGGGG AAGGCTACGT 114 0 

CCGCCTATCT TATGCAGCCA GCATGGAGAC TATCAAAGAA GCCATGAAAC GACTTGAGGA 12 00 

GTACATGAGA GAAGCATGAT TCAGTCTATC ACGAGTCAAG GCTTGGTGCT TTACAATCGC 12 60 

AATTTTCGTG AGGATGACAA GCTCGTCAAA ATTTTTACAG AGCAGGTTGG CAAACGCATG 13 2 0 

TTTTTTGTCA AACACGCTGG TCAGTCTAAG CTGGCGCCTG T T AT TC AG C C CTTGGTGCTG 13 80 

G C AC GAT T TC TCTTGCGAAT CAATGATGAC GGACTCAGTT ACATCGAAGA CTATCATGAG 144 0 

GTCATGACTT TTCCCAAGAT TAATAGTGAC CTCTTTGTCA TGGCCTATGC GACCTATGTG 1500 

GCAGCTCTTG CAGATGCTAG TTTGCAGGAC AATCAGCAGG ATGCTCCCTT GTTTGCTTTT 15 60 

TTGCAAAAGA CTTTGGAGTT GATGGAAGCA GGCTTGGATT ATCAGGTTTT GACCAATATT 162 0 

TTTGAAATTC AAATTTTGAC TCGATTTGGA ATCAGCCTCA ATTTTAATGA GTGTGTCTTC 16 80 

TGCCATCGGG TTGGTCAGGC TTTTGACTTT TCTTTCAAAT ATGGAGCCTG CCTCTGTCCA 1740 

GAGCATTATC ATGAGGATAA GAGACGTTGT CATCTCAATC CCAATATCCC CTATCTGCTC 1800 
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AATCAATTTC AAGCTATTGA TTTTGAGACT TTGGAGACCA TTTCGCTCAA GCCTGGAATC 18 60 

AAGCAAGAGC TACGCCAATT T AT GG AT C AA TTATATGAAG AGTACGTTGG GATTCACCTA 192 0 

AAATCAAAGA AATTT AT T G A TTCCCTAGCA GACTGGGGAC AAT TACT AAA AGAGGAAAAG 19 80 

AAATGAAAAA AATCGCAGTA GAT G C C AT GG GGGGCGATTA CGCACCTCAG GCCATTGTTG 2 04 0 

AGGGTGTCAA TCAAGCCCTA TCTGACTTTT C AG AT AT C G A GGTTCAACTT T ACGG AG AT G 2100 

AAGCTAAAAT CAAGCAATAT CTG AC AG C G A CAGAGCGCGT CAGCATTATC CATACGGATG 2160 

AGAAGATTGA TTCGGATGAT GAACCTACGA GAGCTATTCG GAATAAGAAA AATGCCAGTA 222 0 

TGGTATTGGC AGCCAAGGCT GTCAAAGATG GTGAAGCAGA CGCTGTCCTT TCGGCTGGGA 22 80 

ATACAGGTGC CTTGTTGGCA GCAGGATTCT TCATCGTGGG TCGTATCAAG AATATCGACC 2340 

GT C C T GG ACT CATGTCTACC TTGCCTACCG TTGATGGAAA AGGTTTTGAC ATGCTAGACC 2 400 

TTGGTGCCAA TGCAGAAAAT ACAGCCCAGC ACCTCCATCA ATATGCGGTT CTAGGTTCCT 24 60 

TCTATGCTAA AAATGTCCGT GGCATTGCGC AACCACGCGT TGGTTTGCTC AACAACGGAA 2 52 0 

CAGAGAGTAG CAAGGGCGAC CCGCTTCGTA AGGAAACTTA TGAATTACTG GCGGCTGATG 2 5 80 

AAAGTTTGAA CTTTATCGGA AACGTGGAAG CGCGTGATTT GATGAATGGC GTTGCAGATG 2 64 0 

TTGTTGTGGC AGATGGTTTC ACGGGAAACG CTGTGCTCAA ATCCATCGAA GGGACAGCTA 2 700 

TGGGAATCAT GGGCTTGCTC AAGACAGCTA TTACAGGTGG TGGTCTTCGA GC G AAAC TAG 27 60 

GTGCCCTCCT TCTCAAGGAC AGCCTCAGTG GTTTGAAAAA ACAGCTCAAT TATTCAGATG 2 82 0 

TTGGTGGAGC GGTCTTGTTT GGTGTTAAGG CACCTGTTGT CAAGACTCAT GGCTCAAGCG 2880 

ATGCCAAGGC TGTTTATAGT ACGATTCGTC AGATCCGTAC CATGCTAGAA ACAGACGTGG 2 94 0 

TTGCCCAGAC TGCGCGTGAA TTTTCAGGAG AATAAAAGAG ATGACAGAAA AAGAAATTTT 3 0 00 

TGACCGTATT GTG AC CAT T A TCCAAGAGCG ACAGGGAGAG GACTTTGTCG TGACAGAATC 3060 

CTTGAGTCTG AAAGACGATT TGGATGCGGA TTCTGTTGAC TTGATGGAGT TTATCTTGAC 3120 

TCTGGAAGAT GAATTTAGTA TCGAAATCAG CGATGAAGAA ATTGACCAAC TCCAAAACG V 3180 

AGGAGATGTG GTTAAAATCA TTCAAGGAAA AT AG C AAT C G GAGTTCCAAG TCAACGGAAG 32 4 0 

TAGATGGTTT T T AG AAATG A GAAATATCGG ACAAGCTGGT AAAATCTTGG CTGACAGTGG 33 00 

TTATCAAGGG CTCATGAAGA TATATCCTCA AGCACAAACT CCACGTAAAT CCAGCAAACT 3 3 60 

CAAGCCGCTA ACAGTTGAAG ATAAAGCCTG TAATCATGCG CTATCTAAGG AGATAAGCAA 3420 

GGTTGAGAAT ATCTTTGCCA AAGTAAAAAC GTTTAAAATG TTTTCAACAA CCTATCGAAA 34 80 

TCATCGTAAA CGCTTCGGAT TACGAATGAA TTTGATTGCT GGTATTATCA ATCATGAACT 3540 

AGGATTCTAG TTTTGCAGGA AGTCTAATAG TAAAAAAGTG ATTAGAAAAC ATCTTTTTTA 3 600 
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AAAATAGAGA TGATTTTGAA ACAAAAAAGC TAATTCAAGA CGTTTCGATG CCAATTCAAG 3 660 

ATTTGGATGA AAAAAATTAA TAGATACTGT TATACTAAAC TTGTCAAGTT TGTAACAAGA 3720 

CAAATATTAA AAATAAAAAA GAGGTATTCG TTATGAATAC AAAAACGATG TCACAATTTG 3 7 80 

AAATTATGGA TACTGAGATG CTTGCTTGCG TTGAAGGTGG CGGATGCAAT TGGGGAGATT 384 0 

TTGCCAAAGC AGGTGTTGGA GGAGGAGCAG CACGAGGTCT TCAGCTAGGA ATTAAAACAA 3 9 00 

GAACATGGCA AGGTGCAGCA ACTGGTGCTG TGGGAGGAGC TATACTTGGA GGTGTGGCCT 3 960 

ATGCAGCGAC ATGTTGGTGG TAATTATGGA TTTTAAAAGT TTTATTATTG GTTTAGTAGT 402 0 

TGGTATATTT GGTCCTTATA TGGATGATTT AATTAGAAAA AAATTTTTAA AGTCTTCGGA 4 0 80 

GAAGAAAACA GAAAAAT CTG TTAAAAAATA AT C AAAAC T A TAAATGATGA ATCTGAATCA 4140 

AAATTATTTT GCGCATGTAA AGAGGAGTCT TATAGTAACG AGTCAAAAAA GGAGTAACTA 4200 

TGAATCGTAA TTTAGAACGG TGTTATCTAT TCTGACTAGG AAT AG AT C AT ACCAGAGGTA 4 2 60 

GCTTAGAAAT AGCAGAGACA TTAGAAATTG AAGTAATAAA TAGGATGTCG TAAGTGTTAC 4320 

TATCAATGAT TTATTTGTTT CAAGCTTGCC TAGGGTGACA GTAAAAAATC AATTTCCTTT 4 380 

CAATAGCATA TTTTTAGTGG GCAGGACTCT TGTTCTGCCT ATTTTTTTAT CCAAAAAGTG 444 0 

CAGTTGGGAG GGAGATAGGC TCATTTGGGA AGGAAGTCCA GTTTTTGTTT AGTGATTGGG 4 500 

GTAAGATAGT TGTTATCAGA TGAGTTAATA CTCTTCGAAA ATCAAATTCA AACCACGTCA 45 60 

ACGTCGCCTT GCCGTATATA TGTGACTGAC TTCGTCAGTC CTATCTACAA CCTCAAAACA 4 620 

GTGTTTTGAG CAGCCTACGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA 4680 

GGGAAAAGGA GATGAATATG AAATTTGGGA AACGTCATTA TCGTCCGCAG GTGGATCAGA 4740 

TGGACTGCGG TGTAGCTTCA TTAGCCATGG TTTTTGGCTA CTATGGTAGT TATTATTTTT 4 800 

TGGCTCACTT GCGAGAATTG GCTAAGACGA CCATGGATGG GACGACGGCT TTGGGCTTGG 48 60 

TCAAGGTGGC AGAGGAGATT GGTTTTGAGA CGCGAGCCAT TAAGGCAGAT ATGACGCTTT 4 920 

TTGACTTGCC GGATTTAACT TTTCCTTTTG TTGCCCATGT GCTTAAGGAA GGGAAATTGC 4980 

TCCACTACTA TGTGGT G ACT GGGCAGGATA AGGATAGCAT TCATATTGCC GATCCAGATC 5040 

CCGGGGTGAA GTTGACTAAA CTGCCACGTG AGCGTTTTGA GGAAGAATGG ACAGGAGTGA 5100 

CTCTTTTTAT GGCACCTAGT C C AG ACT AT A AGCCTCATAA GGAACAAAAA AATGGTCTGC 5160 

TCTCTTTTAT CCCTATATTA GTGAAGCAGC GTGGCTTGAT TGCCAATATC GTTTTGGCAA 5220 

CACTCTTGGT AACCGTGATT AAC AT TGTGG GTTCTTATTA TCTGCAGTCT ATCATTGATA 52 80 

CCTATGTGCC AG AT C AG ATG CGTTCGACAC TAGGGATTAT TTCTATTGGG CTAGTCATCG 5340 
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TCTACATCTT CCAGCAAATC TTGTCTTACG CTCAGGAGTA TCTCTTGCTT GTTTTGGGGC 5400 

AACGCTTGTC GATTGACGTG ATTTTGTCCT ATATCAAGCA TGTTTTTCAC CTCCCTATGT 54 60 

CCTTCTTTGC GACACGCAGG ACAGGGGAGA TCGTGTCTCG TTTTACAGAT GCTAACAGTA 552 0 

T C AT CG AT G C GCTGGCTTCG ACCATCCTTT CGATTTTCCT AGATGTGTCA ACGGTTGTCA 5580 

TTATTTCCCT TGTTCTATTT T C AC AAAAT A CCAATCTCTT TTTCATGACT TTATTGGCGC 5 64 0 

TTCCTATCTA CACAGTGATT ATCTTTGCCT TTATGAAGCC GTTTGAAAAG ATGAATCGGG 5700 

ATACCATGGA AGCCAATGCG GTTCTGTCTT CTTCTATCAT TGAGGACATC AACGGTATTG 57 60 

AG AC T AT C AA GTCCTTGACC AGTGAAAGTC AGCGTTACCA AAAAATTGAC AAGGAATTTG 5 820 

TGGATTATCT GAAGAAATCC TTT ACC TATA GTCGAGCAGA GAGTCAGCAA AAGGCTCTGA 5880 

AAAAGGTTGC CCATCTCTTG CTTAATGTCG GCATTCTCTG GATGGGGGCT GTTCTGGTCA 5 940 

TGGATGGCAA GATGAGTTTG GGGCAGTTGA TTACCTATAA TACCTTGCTG GTTTACTTTA 6000 

CTAATCCTTT GG AAAAT AT C ATCAATCTGC AAACCAAGCT TCAGACAGCG CAGGTTGCCA 6 0 60 

ATAACCGTCT AAATGAAGTG TATCTAGTAG CTTCTGAGTT TGAGGAGAAG AAAACAGTTG 612 0 

AGGATTTGAG CTTGATGAAG GG AG AT AT G A CCTTCAAGCA GGTTCATTAC AAGTATGGCT 6180 

ATGGTCGAGA TGTCTTATCG GATATCAATT TAACCGTTCC CCAAGGGTCT AAGGTGGCTT 6240 

TTGTGGGGAT TTCAGGGTCA GGTAAGACGA CTTTGGCCAA GATGATGGTT AATTTTT AC G 6300 

ACCCAAGTCA AGGGGAGATT AGTCTGGGTA GTGTCAATCT CAATCAGATT GATAAAAAAG 6360 

CCCTGCGCCA GT AC AT C AAC TATCTGTCTC AACAGCCCTA TGTCTTTAAC GGAACGATTT 6420 

TGGAGAATCT TCTTTTGGGA GCCAAGGAGG GGACGACACA GGAAGATATC TTACGGGCGG 6480 

TCGAATTGGC AGAGATTCGA GAGGATATCG AGCGCATGCC ACTGAATTAC CAGACAGAAT 6540 

TGACTTCGGA TGGGGCAGGG ATTTCAGGTG GTCAACGTCA GAGAATCGCT TTGGCGCGTG 6600 

CTCTCTTGAC AGATGCGCCG GTCTTGATTT TGGATGAGGC GACTAGCAGT TTGGATATTT 6 660 

TGACAGAGAA GCGGATTGTC GATAATCTCA TTGCTTTGGA CAAGACCTTG ATTTTCATTG 6720 

CTCACCGCTT GACTATTGCT GAGCGGACAG AGAAGGTAGT TGTCTTGGAT CAGGGCAAGA 6780 

TTGTCGAAGA AGGAAAGCAT GCTGATTTGC TTGCACAGGG TGGCTTTTAC GCCCATTTGG 6840 

TCAATAGCTA GAAAGAGGAG AGGATGAAAC CAGAATTTTT AGAAAGTGCG GAGTTTTATA 6900 

ATCGTCGTTA CCATAATTTT TCCAGTAGTG TG ATT GT AC C CATGGCCCTT CTGCTTGTGT 6960 

TTTTACTTGG CTTTGCAACT GTTGCAGAGA AGGAGATGAG TTTGTCCACT AGAGCTACTG 7 020 

TCGAACCTAG TCGTATCCTT GCAAATATCC AGT C AAC TAG CAACAATCGT ATTCTTGTCA 7 080 

ATCATTTGGA AGAAAATAAG CTGGTTAAGA AGGGGG AT C T TTTGGTTCAA TACCAAGAAG 7140 



WO 98/18931 



PCT/US97/19588 



665 

GGGCAGAGGG TGTCCAAGCG GAGTCCTATG CCAGTCAGTT GGACATGCTA AAGGATCAAA 7200 

AAAAGCAATT GGAGTATCTG CAAAAGAGCC TGCAAGAAGG GGAGAACCAC TTTCCAGAGG 72 60 

AGGATAAGTT TGGCTACCAA GCCACCTTTC GCGACTACAT CAGTCAAGCA GGCAGTCTTA 732 0 

GGGCTAGTAC ATCGCAACAA AATGAGACCA TCGCGTCCCA GAATGCAGCA GCTAGCCAAA 7 3 80 

CCCAAGCCGA AATCGGCAAC CTCATCAGTC AAACAGAGGC TAAAATTCGC GATTACCAGA 7440 

CAGCTAAGTC AGCTATTGAA ACAGGTGCTT CCTTGGCCGG TCAGAATCTA GCCTACTCTC 7 500 

TTTACCAGTC CTACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCAGTTG 7 5 60 

CACAGGTTGA AGCACAGATT TCTCAGTTAG AATCTAGTCT TGCTACTTAC CGTGTCCAGT 7 620 

ATGCAGGTTC AGGTACCCAG CAAGCCTATG CGTCAGGGTT AAGCAGTCAA TTGGAATCCC 7680 

TTAAATCCCA ACACTTGGCA AAGGTTGGTC AGGAATTGAC CCTTCTAGCC CAGAAAATTT 7740 

TGGAGGCAGA GTCAGGTAAG AAGGTACAGG GAAATCTTTT AGACAAGGGG AAAGTTACGG 7 8 00 

CGAGTGAGGA TGGGGTGCTT CAT CTT AATC C T GAG AC C AG TGATTCTAGC ATGGTTGCAG 7 860 

AAGGTGCCCT ACT AG C C C AA CTTTATCCAT CTTTGGAAAG AGAAGGGAAA GCCAAACTCA 792 0 

CAGCTTATCT AAGTTCAAAA TATGTAGCAA GAATCAAGGT CGGTGATTCT GTTCGCTATA 7 9 80 

CTACGACTCA TGATGCCGGG AATCAACTTT TCCTAGATTC TACTATTACA AGTATTGATG 804 0 

CGACAGCTAC TAAGACTGAG AAAGGGAATT TCTTTAAAAT CGAGGCGGAG ACTAATCTAA 8100 

CTTCGGAGCA GGCTGAAAAA CTTAGGTACG GGGTGGAAGG CCGCTTGCAG ATGATTACGG 8160 

GCAAGAAAAG TTACCTACGT TATTATTTGG ATCAATTTTT GAACAAAGAG TAATGTTCGT 8220 

GTTTTTAGAG TTAAATAATT TTTAAACTGT GAGAAAGATT CTTCTTGCAG TTTTTTCTTT 82 80 

ACAATTTTTG AAAAACATCT ACTATTTATT CGGTTAAATT CTTGTGTTTT TTGGTTTTTT 8340 

GTGGTAAAAT GTGCTCAAGT AATACGAAAG GCGAACTTTA AAATGTCAAA ACAATTGATC 84 00 

TATTCGGGAA AAGCTAAAGA TATCTATACA ACTGAGGATG AAAATCTTAT TATTTCAACT 84 6 0 

TACAAGGACC AGGCGACTGC TTTCAACGGT GTCAAGAAGG AGCAGATTGC AGGTAAGGGA 8520 

GTCTTGAATA ATCAGATCTC ATCTTTTATT TTTGAGAAAT TAAATGTGGC TGGTGTGGCG 8580 

ACT C AC TTTG TGGAGAAACT TTCAGACACG GAACAACTCA ATAAAAAGGT TAAGATTATT 8 640 

CCTTTGGAAG TCGTGCTCCG CAACTATACT GCTGGTTCCT TTTCAAAACG TTTTGGTGTG 8700 

GATGAGGGAA TCGCCTTGGA GACTCCGATT GTCGAATTTT AC T AC AAAAA TGATGATTTG 87 60 

GATGATCCAT TTATCAATGA TGAGCATGTG AAATTCCTAC AGATTGCGGG TGACCAGCAG 8820 

ATTGCCTACT TGAAGGAAGA AACGCGTCGT ATCAATGAAC TATTGAAAGT CTGGTTTGCT 88 8 0 



WO 98/18931 



PCT/US97/19588 



666 

GAGATTGGGC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGGCAAG 8 940 

ATTATCTTGG CAGACGAATT TTCACCAGAT AACTGCCGCT TGTGGGACGC TGATGGCAAC 9000 

CACATGGATA AGGATGTTTT CCGTAGAGGA TTGGGAGAAC TAACCGACGT TTATGAGATT 9060 

GTTTGGGAAA AGTTGCAGGA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCTCTC 9120 

AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA GAGAACTAGG 9180 

TGAACTAACA GATGTTTACG AAATTGTCTG GGAAAAGTTG CAGGGTTTAA AATAACAACC 9240 

TCAAGGCTGT TTGGGAATAT TGCAAGAGCT GAAATAAAGG AATAAGAATT GATGGATAAA 9 3 00 

CGTATTTTTG TTGAAAAAAA GGCTGATTTT CAGGTCAAGT CAGAGAGTTT GGTTAGAGAG 93 60 

CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGT ATT C GTATTGTGCA AGTATATGAT 942 0 

GTATTTGACT TGGCTGAGGA CTTGTTTGCA C C T GC AG AG A AGCACATTTT CTCTGAGCAG 9480 

GTAACCGACC ATGTTTTAGA TGAAGTATCT GTGCAGGCGG ATCTTGCTAA CTATGCTTTC 954 0 

TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCAGCGTG CAGCTTCGTC ACAGGAAGCC 9 600 

TTGCTTTTGT TGGGAAGTTC GAGTGACGTG ACAGTCAACA CAGCCCAACT TTACTTGGTG 96 60 

AATAAAGATA TTGATGCGAC TGAGTTGGAA GCTGTCAAAA ACTACCTGCT CAATCCAGTT 972 0 

GATTCTCGTT TCAAGGATAT CACGACAGGG ATTGCCAAGC AGGAGTTTTC AGAGTCAGAC 97 8 0 

AAGACCATTC CCAAATTGAC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCGCTAC 9 840 

AAGGCCGAAC AAGGGATGGC CATGGAAGTG GATGATTTGC TCTTTATCCA AGACTACTTT 9900 

AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAAGG TTTTGGACAC TTACTGGTCT 9960 

GACCACTGCC GTCATACGAC TTTTGAGACA GAGTTGAAAC ACATCGACTT TTCAGCTTCT 1002 0 

AAATTTCAAA AGCAATTGCA GTCAACCTAT GACAAGTATA TTGCCATGCG CGAGGAATTA 10080 

GGTCGGTCTG AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 10140 

CGTGCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTGCTCAGTT 102 00 

GAAATTGAAG TGGACGTTGA TGGTGTCAAG GAACCTTGGC TCCTCATGTT TAAAAACGAA 10260 

ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGGAG CGGCTACCTG TATTGGTGGA 103 2 0 

GCTATTCGTG ATCCGTTGTC AGGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 10380 

GCTGGTGATA TTACAGCACC GATTTCGGAA ACTCGCGCTG GGAAATTGCC ACAACAAGTC 1044 0 

ATTTCTAAAA CAGCAGCTCA TGGTTATTCT T C AT AT GGTA ACCAGATTGG GCTTGCAACA 10500 

ACCTACGTTC GTGAATACTT CCACCCAGGC TTTGTAGCTA AACGTATGGA ACTTGGTGCC 10 560 

GTTGTTGGTG CGACTCCCAA GGGCAATGTT GTCCGTGAAA AACCTGAAGC AGGTGATGTG 10 620 

ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT 10680 
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AAGGTTCAAA CAGTTGAGTC TGTAGAGACT GCTGGTGCTG AGGTTCAAAA AGGAAATGCC 10740 

ATCGAAGAAC GCAAGATTCA GCGCCTCTTC CGTAATGGCA ATGTCACTCG TCTGATCAAG 10800 

AAGTCCAATG ACTTTGGGGC AGGCGGTGTC TGTGTGGCTA TCGGTGAATT GGCAGACGGT 10860 

CTTGAAATCG ACCTCAACAA GGTGCCTCTT AAATACCAGG GCTTGAATGG TACAGAAATT 1092 0 

GCCATCTCTG AATCACAAGA ACGGATGGCG GTCGTGGTTC GTCCTGAAGA TGTGGATGCC 10980 

TTCGTTGCCG AATGTAACAA AGAAAATATT GATGCTGTTG TGGTGGCGAC AGTAACTGAA 11040 

AAACCAAATC TTGTCATGCA CTGGAATGGT G AG AC AATC G TTGACTTGGA GCGTCGTTTC 11100 

CTTGACACCA ATGGTGTGCG CGTGGTTGTC GATGCCAAAG TTGTGGACAA GGATGTCAAA 11160 

CTCCCAGAAG AGCGTCAAAC ATCTGCTGAA ACACTGGAAT C AG AT AC C C T TACGGTTCTA 1122 0 

TCTGACCTCA ACCATGCAAG TCAAAAAGGA TTACAGACTA TCTTTGACTG CTCTGTTGGA 11280 

CGCTCAACGG TTAATCACCC ACTTGGTGGT CGTTACCAAC TCACACCAAC TGAGGCATCT 11340 

GTGCAGAAAT TGCCAGTTCA ACACGGTGTG ACTCATACTG CGTCGGTCAT TGCTCAAGGT 11400 

TTCAACCCAT ATGTAGCTGA ATGGTCTCCA TACCACGGTG CTGCTTATGC GGTTATCGAA 114 60 

GCAACTGCTC GTTTGGTGGC TGCTGGTGCC AACTGGTTCA AGGCTCGTTT CTCTTACCAA 11520 

GAGTATTTCG AGCGTATGGA TAAACAAGCA GAGCGTTTCG GTCAGCCAGT AGCTGCTCTT 115 80 

CTAGGTTCTA TTGAAGCACA AATTCAGCTT GGCTTGCCAT CTATCGGTGG TAAGGACTCC 11640 

ATGTCTGGTA CCTTTGAAGA ATT G AC C GT T CCGCCAACCT TGGCTGCCTT TGGGGTGACG 11700 

ACGGCAGATA GCCGTAAGGT GCTCTCTCCA GAATTTAAAG CTGTTGGGGA AAATATCTAC 117 60 

TACATCCCAG GTCAAGCCCT CTCTGCAGAG ATTGATTTTG ACTTGATTAA GAAAAATTTT 11820 

GCTCAGTTTG AAGCCATCCA AGCTGACCAT AAAGTGACAT CTGCATCAGC TGTCAAATAC 1188 0 

GGTGGTGTAG TTGAAAGTTT GGCTCTTGCT ACCTTTGGAA ACTATATTGG TGCAGAGGTG 1194 0 

ACCTTGCCTG AACTTGAAAC AGCTTTGACA GCTCAATTAG GCGGCTTTGT CTTCACATCT 12 000 

CCTGAAGAAA TTGCTGGAGT AG AG AAGGT T GGACAAACGA AAGC AG AC T T TACACTGACT 12 060 

GTCAACGGTG TGAAGCTAGA TGGACACAAG CTTGACAGTG CATTTCAAGG GACATTGGAA 1212 0 

G AAGTTT AC C CAACAGAATT TACCCAAGCG AAAGAACTAG AAGAAGTACC AGCTGTGGCA 12180 

TCAGATGTTG TGATT AAAG C CAAAGAAAAG GTTGAAAAAC CTGTGGTTTA CATCCCAGTC 12 240 

TTTCCAGGAA CCAACTCAGA ATATGATTCA GCTAAGGCCT TCGAAAAAGA AGGTGCAGAG 12300 

GTCAATTTGG TGCCATTCGT GACCTTGAAT GAAGAAGCTA TTGTCAAGTC AGTTGAAACT 12 3 60 

ATGGT TG AC A ATATCGACAA GACTAATATT CTCTTCTTTG CTGGTGGATT CTCGGCTGCG 1242 0 
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GATGAACCAG ATGGTTCAGC TAAGTTTATC GTCAATATCC TGCTTAATGA AAAAGTGCGT 12 4 80 

GTGGCTATTG ATAGCTTTAT CGCCCGTGGT GGTTTGATTA TCGGTATTTG TAATGGATTC 12 540 

CAAGCCTTAG TCAAATCGGG TCTCCTACCC TACGGAAACT TTGAAGCTGC TAACAGTACT 12 600 

AGCCCAACCC TCTTCTACAA TGATGCCAAC CAACACGTGG CCAAGATGGT GGAAACTCGC 12 660 

ATTGCCAATA CCAACTCACC ATGGTTGGTT GGTGTGCAAG TGGGCGATAT CCACGCTATT 1272 0 

CCTGTTTCGC ACGGTGAAGG GAAGTTTGTC GTGACGGCTG AGGAATTTGC AGAGCTCCGT 12780 

GACAATGGAC AAATTTTCAG CCAATACGTT GACTTTAACG GTAAACCAAG TATGGATTCT 12 840 

AAGTACAATC CGAATGGTTC TGTCCATGCC ATCGAAGGAA TTACCAGCAA GAATGGTCAA 12 9 00 

ATCATCGGTA AGATGGGCCA CTCAGAACGT TATGAGGATG GTCTTTTCCA AAATATCCCA 12 960 

GGCAATAAAG ACCAACACCT GTTCGCATCA GCGGTTAAAC ATTTCACTGG AAAATAAGAC 13020 

TT AC AG ATT T TCTAATAGAT AGTATCAGTA AT GT AAAAGT CATGTAAATC TAGCTCTTGA 13 080 

TGATTACAAA TGAAAATTAG GTATAAAAAA TGACATACGA AGTAAAATCT CTTAATGAAG 13140 

AATGTGGTGT TTTCGGTATT TGGGGACATC CAGATGCTGC TAAGTTGACC TATTTTGGAC 13200 

TCCACAGTCT TCAACACCGT GGTCAGGAGG GGGCAGGAAT CCTCTCCAAT GAT C AAGG AC 132 60 

AACTGAAGCG CCATCGTGAC ATGGGGCTTT TATCAGAAGT TTTCAGAAAT CCAGCTAATT 1332 0 

TGGATAAATT GACAGGAGCT GGTGCGATTG GGCATGTGCG TTATGCGACT GCTGGCGAAG 133 80 

CTTCTGTAGA TAACATCCAG CCCTTCCTCT TCCGTTTTCA CGATATGCAG TTTGGTTTGG 13440 

CTCATAATGG AAATCTGACC AATGCAGCCT CTCTCAAGAA AGAACTGGAA CAAAGAGGAG 13 500 

CAATTTTCAG CGCGACTTCG GACTCTGAAA TCTTGGCTCA CCTCATTCGT CGCAGTCATA 13 560 

ATCCTAGCCT GATGGGCAAA AT C AAGGAAG CGCTCAGCCT TGTCAAAGGT GGTTTTGCCT 13 62 0 

ATATCTTGCT GTTTGAGGAC AAGTTGATTG CGGCTCTTGA CCCAAATGGA TTCCGACCGC 13 68 0 

TTTCGATTGG TAAAATGGCT AATGGAGCAG TTGTTGTATC TTCTGAAACC TGTGCTTTTG 13 74 0 

AGGTCATTGG TGCCGAGTGG ATTCGTGATT TGAAGCCAGG TGAGATTGTG ATCATTGATG 13 800 

ACGAGGGCAT TCAGTATGAC AGCTATACAG AT GAT AC C C A GTTGGCGGTT TGTTCTATGG 13 8 60 

AGT AT AT C T A CTTTGCTCGC CCTGATTCTA ATATCCACGG TGTCAATGTC CATACGGCAC 13 92 0 

GTAAGAGAAT GGGAGCGCAA TTGGCGCGAG AATTTAAGCA TGAGGCAGAT ATTGTAGTTG 13 9 80 

GTGTGCCCAA TTCTTCCCTA AGCGCGGCTA TGGGATTTGC GGAAGAATCA GGCTTACCAA 14040 

ATGAAATGGG TCTGATCAAA AACCAATACA CCCAGCGAAC TTTTATCCAA CCGACTCAAG 14100 

AATTGCGGGA GCAAGGAGTG CGGATGAAAC TGTCTGCTGT TTCGGGTGTT GTCAAAGGCA 14160 

AACGTGTGGT CATGGTGGAT GATTCCATTG TACGTGGAAC AACCTCTCGT CGTATCGTTC 1422 0 
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AGCTCTTGAA AGAAGCGGGT GCGACTGAGG TTCACGTTGC CATTGGAAGT CCTGCACTAG 14280 

CGTATCCATG TTTCTACGGG ATTGATATCC AG AC C C GT C A GGAGCTGATT GCAGCCAATC 1434 0 

AT ACGGT C G A AGAAACTCGC CAAATCATTG GTGCGGACAG TCTGACTTAT CTTTCAATTG 14400 

ATGGCTTGAT TGAGTCGATT GGTATCGAAA CAGATGCGCC GAACGGTGGT CTCTGTGTCG 144 60 

CTTACTTTGA CGGTGACTAC CCAACGCCTC TTTATGACTA CGAAGAAGAC TATCGTAGAA 14 52 0 

GTTTGGAAGA AAAGACCAGT TTTTACAAGT AGGCGACAGA TTCTCCATTA AAGAAAAGGA 14 580 

AAAAATAAAT GACAAATAAA AATGCATATG CCTCACGTCT CACTACTGAC TAAAGGCTTA 14 64 0 

AGCATTTAGT CAGTAGACGC TTTGTCCTAT AGGATCAAAG CTAGAGCCCT GACTAGTATT 147 00 

T TT AGAT AAA AAGATGGTTT ATCTAAAAAT ACGTCGCAGT CTTTCTCAAA AAAAGAAAAG 147 60 

GAAAAATAAA ATGGCAAATA AAAATGCGTA CGCTCAATCT GGTGTGGATG TTGAAGCGGG 14 82 0 

TTATGAAGTT GTTGAACGGA TTAAAAAGCA CGTGGCCCGT ACGGAGCGTG CAGGTGTCAT 14 8 80 

GGGAGCTCTT GGTGGCTTTG GTGGTATGTT TGACCTTTCC AAGACTGGGG TTAAAGAACC 14 94 0 

CGTCTTGATT TCAGGGACTG ACGGTGTCGG AACCAAGCTC ATGTTGGCTA TCAAGTACGA 150 00 

CAAGCACGAT ACC AT CGGGC AGGACTGTGT GGCCATGTGT GTCAACGACA TCATTGCTGC 15060 

AGGTGCGGAA CCCCTCTATT TT CT CG AC T A CGTAGCGACA GGGAAGAATG AACCAGCTAA 1512 0 

GCTAGAACAA GTGGTTGCTG GTGTGGCAGA AGGTTGTGTG CAGGCTGGTG CTGCCCTCAT 1518 0 

CGGTGGGGAA ACGGCTGAAA TGCCGGGCAT GTACGGCGAA GACGACTATG ACTTGGCTGG 15240 

TTTTGCGGTC GGTGTGGCTG AAAAATCTCA AATCATTGAC GGTTCAAAGG TGGTAGAGGG 153 00 

AGATGTTCTT CTCGGACTTG CTTCAAGTGG G ATT C AC T C A AATGGTTACT CTTTGGTTCG 153 60 

TCGTGTCTTT GCGGATTACA CAGGTGAGGA AGTCCTACCA GAATTGGAAG GCAAGAAACT 15420 

TAAGGAAGTT CTACTTGAGC CGACTCGTAT CTATGTCAAG GCTGTCTTGC CGCTCATCAA 154 80 

AGAAGAGTTG GTCAACGGCA TTGCCCACAT CACAGGTGGT GGCTTTATCG AAAATGTCCC 15 54 0 

TCGTATGTTT GCAGATGACC TAGCTGCTGA AATTGATGAA AGTAAAGTTC CAGTGCTTCC 15 6 00 

AATTTTCAAA ACCCTTGAAA AATACGGTCA GATTAAACAC GAAGAAATGT TTGAAATCTT 15660 

CAATATGGGT GTGGGACTTA TGTTGGCGGT CAGCCCTGAA AATGT AG AG C GTGTAAAAGA 15720 

ATTGTTGGAT GAAGCAGTCT ATGAAATTGG TCGCATCGTC AAGAAAGAAA ACGAAAGTGT 15780 

CAT TAT C AAA TGAAAAAAAT AGCGGTTTTT GCCTCTGGTA ATGGCTCAAA TTTTCAGGTG 15840 

ATTGCCGAAG AATTTCCAGT GGAGTTTGTC TTTTCAGACC ATCGTGATGC CTATGTGCTT 15 9 00 

GAGCGTGCAA AGCAGCTCGG CGTTCTGTCC TATGCTTTTG AACTCAAGGA GT T TGAGAGC 15 9 60 
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AAGGCAGACT ACGAAGCAGC CCTTGTCGAA CTCTTGGAAG AACACCAGAT TGACTTGGTT 16020 

TGCCTAGCAG GCTACATGAA AATCGTTGGA CCAACCTTAT TGTCGGCTTA TGAAGGTCGG 16080 

ATTGTCAACA TTCATCCAGC CTACTTGCCA GAATTTCCAG GAGCTCATGG GATTGAGGAT 1614 0 

GCTTGGAATG CTGGCGTGGG TCAGTCTGGT GTGACCATTC ACTGGGTGGA TTCGGGTGTG 162 00 

GATACAGGCC AGGTCATCAA ACAGGTTCGT GTGC C AC G AC TAGCTGATGA T ACC ATT G AC 16260 

AG AT TTG AAG CTCGCATCCA TGAAGCAGAG TACAGGCTGT ATCCGGAAGT AGTGAAGGCT 16320 

CTATTTACAG ATTGACTTTT TGATGATTCA TATGATATCT TTGATTTTAA ATTGGAGTCA 16380 

GTGTTTGTTG AAGACGGCTT CAAACGGAGG TATTTGTAAT GTTAGAATCT AAAAAAACAA 16440 

CTCGATATGT ATT TT AT GT C TATCTGATGT TATTAACTTG GGGAATCTTA TTTAAGTTTG 16500 

AAACAAATCC TGAATTTATA GCATTTTTCT TAGCTCCAAG GTATATCAAT TGGATTCCAT 16560 

TTTCAGAACC ACTAATAGTC GATGGAAAAA TTGTTTTTGC TGAAATGTTA TTTAATCTGA 16 620 

TTTTCTTTAT TCCATTAGGT GTTTGTTTCC CTTTGATAAA AACTAATTTA TCTAGTTTAA 16680 

GAATAGTCGG GACAGGTTTC TTGATTAGTT TATTGTTTGA GTGCTTACAG TATATTTTAG 16740 

CAATAGGTAT AACAGATATA ACGGATTTGA CTTTAAATAC GCTAGGTGTC TGTGTAGGCT 16 800 

TACTGATTTA TCAAATTTTT ATAAGAGTGT T C AAATC AC A GACTAGAAAA TGGATCAATA 16860 

TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG CATTTACTTA 16 920 

GTGTTTAACT AATGATTAAA AAGGAGAATA TAATGACTAA ACGCGTCTTA ATCAGCGTCT 169 80 

CAGACAAAGC GGGCATTGTT GAATTTGCCC AAGAACTCAA AAAACTTGGT TGGGAGATTA 17 04 0 

TCTCAACAGG TGGAACTAAG GTTGCCCTTG ATAATGCTGG GGTGG AT AC C ATTGCTATCG 17100 

ATGATGTGAC TGGTTTCCCA GAAATGATGG ACGGTCGTGT GAAGACCCTC CACCCAAATA 17160 

TCCACGGAGG GCTTCTCGCT CGTCGTGACT TGG AT AG C C A CTTGGAAGCG GCTAAGGACA 17220 

ACAAGATTGA GCTCATTGAC CTTGTGGTGG TCAACCTTTA CCCATTTAAG GAAACTATCC 172 80 

TTAAACCAGA TGTGACTTAT GCTGATGCAG TTGAAAATAT C GAT AT TGGT GGGCCATCTA 17 34 0 

TGCTTCGTTC AGCAGCGAAA AATCATGCCA GTGTTACAGT TGTGGTAGAT CCTGCTGACT l7400 

ACGCTGTGGT TTTGGATGAA TTGGCAGCAA ACGGCGAAAC CTCTTATGAA ACTCGCCAAC 174 60 

GTTTAGCAGC CAAAGTATTT CGT C AC AC AG CGGCTTATGA CGCCTTGATT GCAGAATACT 17 52 0 

TCACAGCTCA AGTGGGTGAA AGCAAGCCTG AAAAACTCAC TTTGACTTAT GACCTCAAGC 17580 

AACCAATGCG TTACGGTGAG AATCCTCAAC AAGACGCGGA CTTTTACCAG AAAGCTTTGC 17 64 0 

C T AC AG ACT A CTCCATTGCT TCAGCCAAAC AGCTCAACGG GAAAGAATTG TCATTTAATA 1770 0 

ATATCCGTGA TGCAGATGCT GCTATCCGTA TCATCCGTGA CTTCAAAGAT AGTCCAACCG 177 60 
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TTGTGGCTCT CAAACACATG AATCCATGTG GAATTGGTCA AGCTGATGAC ATCGAGACTG 17 82 0 

C T TGGG ACT A CGCTTATGAG TCTGACCCAG TGTCTATCTT TGGTGGGATT GTCGTCCTCA 17 8 80 

ACCGTGAGGT GGATGCTGCG ACAGCTGAGA AGATGCACGG CGTTTTCCTC GAAATCATCA 17 940 

TTGCACCAAG CTATACGGAT GAAGCGCTAG CCATTTTGAT CAATAAAAAG AAAAACTTGC 18000 

GTATCCTTGC CTTGCCATTT AATGCTCAAG AGGCTAGCGA AGTGGAAGCA GAATACACAG 180 60 

GTGTAGTCGG TGGACTTCTC GTGCAAAATC AAGACGTGGT CAAGGAAAGC CCAGCTGACT 18120 

GGCAAGTGGT GACTAAACGT CAGCCAACTG AGACAGAAGC GACTGCTCTT GAGTTCGCTT 18180 

GGAAGGCTAT CAAGTACGTC AAATCAAATG GTATTATCGT GACCAACGAC CACATGACAC 18240 

TTGGTGTTGG TCCAGGTCAA ACCAACCGTG TGGCTTCTGT TCGCCTTGCC ATTGACCAAG 18 3 00 

CCAAAGATCG TCTGGACGGG GCGGTCCTTG CTTCAGATGC CTTCTTCCCA TTTGCGGATA 183 60 

ACGTGGAAGA AATCGCCAAA GCAGGAATTA AGGCCATCAT CCAGCCCGGT GGCTCTGTCC 1842 0 

GTGACCAAGA AT C CAT CG AA GCAGCGGATA AATACGGCTT G AC TAT GGTC TTTACAGGTG 18480 

TGAGAC ATT T TAGACATTAA GAAGATAAAA GGGAAGAAAA CAGTTTCTTT CCTTTTTTGG 18540 

CTTAAAATAC TAACTGAAAC AAGATTAAAA CGAACTTTTT TGATATAATG TTGGTAAATA 18600 

ATTCGCAAAA GAGGTTGAGG AATGAAACTG CTTGTTGTCG GTTCTGGTGG TCGTG AG CAT 18 66 0 

GCGATTGCTA AAAAGT T ACT TGAATCAAAA GACGTGGAAA AAGTCTTTGT AGCTCCTGGG 1872 0 

AATGATGGGA TGACTCTGGA TGGTTTGGAA TTGGTAAATA TCTCTATTTC CG AAC AT TAT 187 80 

AAATTGATTG ACTTCGCAAA GACCAATGAT GTTGCTTGGA CCTTTATCGG TCCAGATGAT 18840 

GCCCTTGCTG CTGGT AT CGT GGATGATTTT AACCAAGCTG GACTTAAGGC CTTTGGTCCG 18 900 

AC T AGGGCTG CAGCGGAGCT GGAGTGGTCC AAGGATTTCG CCAAGGAAAT CATGGTCAAA 18960 

TACGGCGTTC CGACAGCAAC ATATGGCACA TTTTCAGATT TCGAGGAAGC CAAAGCCTAT 1902 0 

AT C G AAAAGC ATGGTGCGCC TATCGTAGTC AAGGCGGATG GCTTGGCACT TGGGAAGGGT 190 80 

GTCGTCGTTG CTGAGACGGT TGAGCAAGCG GTCGAAGCCG CTCATGAGAT GCTTTTGGAC 19140 

AATAAATTTG GTGACTCAGG TGCGCGCGTG GTTATTGAGG AATTCCTTGA AG G AGAGG AA 19200 

TTTTCACTCT TTGCCTTTGT CAATGGTGAT AAGTTCTACA TCATGCCAAC GGCTCAGGAC 192 60 

CACAAACGTG CCTATGATGG CGACAAAGGG CCTAACACGG GTGGTATGGG TGCCTATGCG 1932 0 

CCAGTCCCAC ACTTACCACA GAGTGTAGTT GATACAGCGG TTGACACCAT TGTCAAGCCA 193 80 

GTTCTAGAAG GGGTGATTAA AGAAGGTCGC CCTTATCTGG GAGTTCTTTA CGCAGGGCTT 19440 

AT C C TGAC AG CTGATGGACC GAAAGTCATT GAGTTCAACG CTCGGTTCGG AGATCCAGAA 19 500 
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ACTCAGATTA TCTTGCCTCG CTTGACCTCT GACTTTGCTC AAAAT AT C AC AG AT AT CC TG 19560 

GATAGCAAGG AGCCAAATAT CATGTGGACG GACAAGGGTG TGACTCTGGG TGTGGTTGTC 19 620 

GCATCCAAGG GCTACCCGCT AG AC TATG AA AGGGGCGTTG AGTTGCCAGC CAAGACAGAA 19680 

GGCGATGTCA TCACCTACTA TGCAGGGGCT AAGTTTGCGG AAAAT AGCAG AGCACTGCTC 19740 

TCAAACGGCG GACGAGTTTA TATGCTCGTT ACC AC AGCAG ATACCGTCAA AGAAGCCCAA 19 800 

GCCAGCATAT ACCAAGAACT ATACCAACAA AAAAT AGAAG GACTCTTCTA CCGAACAGAT 19860 

ATCGGAAGCA AGGCAATTAA GTAAAGATAT AAGAATAACG CGCCGTAGTC GCCAAACACG 1992 0 

ATAATGGTCG TCGTGGTGAA AAGACCAGAA CAGTGAATGT TCTGGTCAGG GGGAAACTTG 19980 

GAGACCTTAG GCTCAAAGTT TAGGAATGAA ACCGAAGGTT TGCTTCCGCC TCCATCACCT 2 0040 

AAG AC C ATT A TCAAAAAGAA AAATAAAAAT T C AC AAAAT A CGTTAATGAT CGTATGGTTT 2 0100 

GCGAGCGTTA GCGAGCTAAT ATAGAACAAT CACCGCCGTT GTGAAAGAAC GATTGGATGA 2 0160 

TAATCCAATC GTTCAGGGAA ATTGGAAGAC CTTGGGTTTC CAATTTAGGC ATG AG AC AC C 20220 

TTTGGTGGCT GCTGCCGTCC CTCACAAGCT AAGGTG AT T G TTGAAAAAGA GGAAAAAGGA 20280 

GAAGAAATGA AACCAGTAAT TTC CAT CATC ATGGGCTCAA AATCCGACTG GGCAACCATG 20340 

CAAAAAACAG CAGAAGTCCT AGACCGCTTC GGTGTAGCCT ACGAAAAGAA AGTTGTTTCC 20400 

GC AC AC CGT A CACCAGACCT CATGTTCAAA CATGCAGAAG AAGCCCGTAG TCGTGGCATC 2 0460 

AAGATCATCA TCGCAGGTGC TGGTGGCGCA GCGCATTTGC CAGGCATGGT AGCTGCCAAA 20520 

ACAACCCTTC CAGTCATTGG TGTGCCAGTC AAGTCTCGTG CTCTTAGTGG AGTGGATTCA 20580 

CTCTATTCTA TCGTTCAGAT GCCGGGTGGG GTGCCTGTTG CGACCATGGC TATCGGTGAA 2 0 640 

GCTGG AG CG A CTAACGCAGC TCTCTTTGCC CTCCGTCTCC TCTCTGTAGA AG AT AAGT C C 2 0700 

ATTGCGGATG C AC TTGCC AA CTTTGCTGAA GAACAAGGAA AAATCGCAGA GGAGTCGTCA 2 0760 

AATGAGCTCA TCTAAAACAA TCGGAATTAT CGGTGGCGGT CAACTGGGTC AGATGATGGC 2 0820 

CATTTCTGCT ATCTACATGG GCCACAAGGT TATCGCGCTG GATCCTGCGG CGGATTGCCC 2 08 80 

GGTCTCTCGT GTGGCGGAAA TCATTGTGGC ACCTTATAAC GATGTAGACG CCCTCCGTCA 2 0940 

GTTGGCAGAC CGTTGCGATG TCCTCACTTA TGAGTTTGAA AATGTCGACG CTGACGGTTT 21000 

GGATGCCGTT AT C AAGG AT G GACAACTCCC T C AAGGAAC A GATCTGCTCC GCATTTCGCA 21060 

AAATCGTATT TTTGAAAAGG ACTTTTTGTC AAACAAGGCT CAAGTCACTG TGGCACCCTA 2112 0 

CAAGGTCGTG ACTTCTAGCC TAGACTTGGC AGATATCGAC TTGTCGAAAA ACTATGTCCT 21180 

CAAGACTGCG ACTGGTGGCT ACGATGGTCA TGGACAAAAG GTTATTCGTT CAGAAGCAGA 21240 

CTTGGAAGCA GCCTATGCGC TAGCAGACTC AGCAGACTGC GTCTTGGAAG AATTTGTCAA 213 00 



WO 98/18931 



PCT/US97/19588 



673 

CTTTGACCTT GAGATTTCTG TCATCGTGTC AGGAAATGGC AAGGAGGTGA CGTTTTTCCC 213 60 

AGTTCAGGAA AATATCCACC GCAACAATAT CCTGTCTAAG ACCATCGTAC CAGCCCGCAT 2142 0 

TTCTGAAAGT CTAGTAGACA AGGCTAAAGC TATGGCAGTG CGAATCGCAG AACAACTCAA 2148 0 

CTTGTCTGGA ACTCTCTGTG TGGAAATGTT TGCGACAGCT GATGACATCA TTGTCAATGA 2154 0 

AATCGCCCCA CGACCACATA ACTCTGGGCA CTATTCTATT GAAGCCTGTG ATTTCTCTCA 21600 

GTTTGACACC CATATTCTGG GTGTTCTCGG AG C AC CAT T A CCAGTCATCA AACTCCATGC 21660 

GCCAGCCGTT ATGCTTAATG TCCTCGGTCA GCATGTCGAG GCTGCTGAAA AATATGTCAC 217 2 0 

AGAAAATCCA AGCGCCCACC TCCACATGTA TGGTAAAATA GAAGCAAAGC ATAATCGTAA 21780 

GATGGGACAT GTGACTTTGT TTAGTGATGT GCCGGATAGT GTGGAAGAGT TTGGGGAAGG 21840 

GATTGATTTT TAGGACAAGT C T ATG AT AC A AATTATCGTT AATACATTTA TTGAAAAGTA 21900 

TAAGACTGGA GCAGTTGTTG AAGTGTTGTA TGCCAGTGCT GACCAAGATA AGGTACAAGC 219 6 0 

TAAATATGAA GAACTAGCTG CACAATACCC CGAAAAT T AT TT AG C TAT CT AT AATGT AC C 2 2 020 

GCTGGATACG GATTTGAATA C ACT AG AT C A TTACCCGTCT GTGTTTATTG G AAAAG AG G A 2 2 080 

GTTTGAGTAG AAATCTTGGT T T AC CT AG AT AGCTTATTCC CAACAGCTTA AGAAGAAAGG 2 214 0 

AAAAATTAAC ACATGATCAA CCGTTACTCT CGCCCTGAGA TGGCGAATAT TTGGAGTGAA 222 00 

GAAAATAAAT ACCGTGCTTG GCTTGAGGTG GAAATCCTCT CTGACGAGGC ATGGGCTGAG 222 60 

TTGGGGGAAA TCCCTAAGGA AG ATGTGGC T TTGATTCGCA AGAAGGCGGA CTTTGACATC 2232 0 

GACCGTATTT TGGAAATTGA GCAGGAGACG CGCCACGATG TGGTGGCTTT CACGCGTGCG 22 3 80 

GTTTCTGAGA CTCTTGGTGA AGAGCGCAAG TGGGTTCACT ATGGGTTAAC TTCTACTGAC 2 2 440 

GTGGTGGATA CTGCTTATGG TTACCTCTAC AAGCAGGCCA ACGACATCAT CCGTCGTGAC 22 500 

CTTGAAAACT TCACTAATAT CATCGCTGAC AAGGCCAAGG AGCACAAGTT CACCATCATG 22 560 

ATGGGGCGTA CTCATGGTGT GCACGCTGAG CCGACAACCT TTGGTCTTAA ATTAGCAACT 2 2 620 

TGGTACAGCG AAATGAAACG CAATATCGAG CGCTTCGAGC ATGCGGCTGC TGGTGTAGAA 2 2 680 

GCTGGTAAGA TTTCTGGTGC GGTTGGGAAC TTTGCCAATA TCCCACCATT TGTAGAGGAG 2 2 740 

TATGTCTGCG ATAAACTTGG CATCCGTGCC CAAGAAATCT CTACACAAGT CCTTCCTCGT 22 800 

GACCTTCACG CTGAGTACTT TGCGGTTCTT GCCAGCATTG CGACTTCAAT CGAACGTATG 22 860 

GCGACTGAGA TTCGTGGTCT ACAAAAATCT GAGCAACGCG AAGTAGAAGA GTTCTTTGCT 2 2 920 

AAAGGGCAAA AAGGGTCTTC AGCAATGCCT CACAAACGCA ACCCAATCGG TTCTGAAAAT 22 980 

ATGACTGGTC TGGCGCGTGT CATTCGTGGT CACATGATTA CGGCTTATGA AAACGTCGCT 23 040 
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CTCTGGCATG AACGCGATAT TTCTCACTCA TCAGCTGAGC G T AT C AT C AC AC CAGATACG 23100 

ACCATTTTGA TTGACTACAT GCTCAACCGT TTTGGAAATA TCGTCAAGAA CTTGACAGTC 23160 

TTCCCAGAAA ATATGATCCG AAACATGAAC TCGACTTTTG GTCTTATCTT TAGCCAACGG 2 322 0 

GCTATGTTGA CATTGATTGA AAAAGGCATG ACCCGTGAGC AAGCCTATGA CTTGGTGCAA 23280 

CAAAAACAGC CTACTCTTGG GACAACCAAG TAGACTTTAA ACCACTTCTT GAGGCAGATT 23340 

CAGAAGTAAC ATCACGTCTC ACACAAGAAG AAATCGATGA AATCTTCAAC CCAGTTTATT 23400 

ACACCAAACG AGTGGATGAT ATCTTTGAAC GTCTTGGACT AGGTGATTAA TTAAAAAATA 23460 

AACAGCGAGC TTCAATCTCG CTGTTTATTT TTTATCGAAA AGACTTAGTC TTCTTTTCTT 2 3 520 

TTAGTGAGTC CATAGGCTGC TAGTGTGGAC ATGAGTCCTG C G AC TACT AG TCCTGCAGAA 23580 

TCGTGAGTTC CTGTTTCAGG AAGTTTTTTC TCTGTTACCA CAGGAGCTGG ATCTTGAGGA 2 3 640 

AGAACTTTGC TTTCCTCAGC AGGAGCAGTT GATGGAGCTG GTTGGCTTGG GATTTCTAGT 2 3700 

TTTGGTTTTT CTTCAGCAAT AGCGGCTTGT CCGTTTTCAT CGCCTACATG TGTTACCATA 2 37 60 

GTTCCGACTT CGACTATTTG AGTAACGGCT TCCTGTGCTA CGACACTATT TACAAGTGTT 2 3 820 

TTCACTTCCT TACCATCGGC AGAAGTGCTC ACAGAGTAGA AGTTGCTACG ATGTCCATTG 23880 

ACGCCCTTAG T AATG ACT TG TGTTTTTCCT TTGAGTAAGA GTGGATTTTC ACAAGTCACT 2 3 940 

GTGGTAAATG GAATTTCTTC TTCTTGGATA TCCAGTCTAG GTTTTACCTC AGTAGTTGGT 2 4 000 

GCAAGACCAC TTTCATCACC CTTGTGAGTT ACAGGAGCGC CAACTTCAAC CACTTGGTTT 24 060 

ATAACTTCTT TGGTTACCTG GCTATCAAGG ACTGTTTCTG TTGTTTTTCC ATTTTCAGTG 2 4120 

AGTACAGAGA TGTAATGAGT TCGTTCACCT TTGACTCCTG CTGTGATAAT ATTTTCCTGA 24180 

CCGGCTGGGA GGTTAGGATT TTCTTTCTTG ATAACTTCAA ATGGAATTTC TTCAGTTCTT 2424 0 

GTGATGAGTT CTGGTCTGGT TTCAACATTG GCAGCCACTT CATTTTCATC TAGGCTTCCT 2430 0 

GAATGAGTTA CAGCTGGTTT GAGGCCTTGA AGAGCGGCTT TTAGGTTGGC TACAAGCGTG 243 60 

TCAAGCTCAG CTTGTTTATT ACGGTTGAGG TTGTAATTTA GAGCTGTTTT AGCTGCGTCA 2 4420 

AGGGCCTCAA GACTTTCTTT ACTATATCCT TCTAAGTTTG TAGGAATTTT AGCTAATTCT 2 4480 

TCGCGGAGAG CATTATAATT AGCACGAAAG TAGTCTTTGT TGTGGTCTGC AAAGGCAGTC 2 4 540 

ATGAGTTCAA AGATTTCCTC T T CCTTGT AT TCAGCGCTTG GTCTATCTGC CCAGATTGAA 2 4 600 

AGCATACTTC CGACTGTTGG AAG AT CT ACT TCAGGATATT TGGTAGAAGC TAGTTGATTG 24 660 

AATGGTGTTT TTCCAGTATT CTCAATAGCT TTCTTGAGGA AACCACCACC ATCTTCTGGT 2472 0 

TTTTGACCAA GAATGTAGTA CCAGTCACCG TTGGTATTCA AGAATTTATA GCCTTTGCTT 24780 

GCTAGGTATT GAGGTGATGC GAGGTTATAT CCCCACCAGC CTTTAGACCA GTAAGAAATC 24 84 0 
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AAGACATCTT TGTCAAACTG AACATCGTCC TTGTCTTCAT AGTAGAAGCC ATCGTTGAAG 24 9 00 

GCCATTGGTT GAAGCCCTCT TTCTTTGGCC ATAGCTGCGA GGGTGTTGGC ATATTCGGCA 24960 

AATTTGCCAT AGAGTTGATA CCACTTGAGG TAGTACCAGC CTTGGGCACT AGTCGCATCG 2502 0 

TTGGCGTATT CGTCAGTACC AAAGTTGAAA ATCTTTGTTT TACCTGCAAA GAAGTCCATG 2 5 080 

TATTTACCGA TGAGGGCTTT TACAAAGTTC ATCGCTTCTT CGTTTTTCAA GTCCATAGTT 2514 0 

GTTTTTGAAA CT T TAT C AAA GTGGGCTTGA GGATTTTTAA TACCTAATTT TTCCATGGCA 2 5200 

ACCAGCATAG CATCCATGTG ACCTGGACTG TTAATAGCTG GG AT GAG AC C GATGTCCTTA 2 5260 

GATTTAGCGT ATTCAATTAG CTCTGTTACT TCTGCCTGTG TTAGTGCAGT ACCGTTTGGA 2 5320 

TCGTCGTAGT AAGCTTTAGT TCCTTCGATA ATAGCTTTTT TAACGTCATC ACTAGCATAG 2 5380 

GTTTTTCCGT TGGCAGTAAT GGT C AT AT C A TCGAGTAGAA AGCGAAGTCC GTCATTTCCT 2 5440 

AGAAGGAGAT G G AC AT C AG A ATATCCGAGC TCACTGGCCT TGTCTACGAT GCGTTTGAGC 2 5500 

TGGTTCAGAG TAAAGTATTT GCGTCCAGCA TCGATTGAGA TTACCTTGTT TTTGGCAAGT 25560 

TTTTCAACCT CACGTTTAGC TTCTTCTTCT TTTTGAGCTT CAGGCGTGAG GGTCAAGTTG 2 5 620 

TTGACAGTTT CTTGAAGTTT AGCAATGGCT TGATCAATCG TATCTTGTTG GGCACGGCTA 2 5 680 

AGGTTGCTAT CGAGAGAGCG AATAGCTTTT TCAGCTTCTT TTACGGCCGT GACGCTTTCT 2 574 0 

GCAGTATAAC GGTTCAGGTC TTTTGGTACC TCGTTAAGTG CTTGCTCTGC AG AT TC AT AA 2 5800 

TCAGCTGCGA AGTATTCAGC GTTGGCATTT GCAAAATGAC GCATGAGTTT GAAGAGGCGT 2 5860 

GATGGTGAAT AACGTGCAGA TGGAGTGTCA GCCCAAGCAG CTACCATACC ACCGATGATT 2 5 920 

GGGATATCAG CTCCTTCTGT TTTTGGTACA GAAGTGATTG GTGTGTTTTT AATACCATTG 2 5980 

AGCCCCTGAT CGAGATTGTA CCAGCCTTGG CCATCAGCGT TTCGTCCAAG AACGTAGTAC 2 6040 

CAAGCATCAT TGGTATTAAG GATTTGGTGA CCTTTTTCAG CTAGTAGTTT AGAAGAAGCG 2 6100 

ACATCGTAGC CTCCCCAACC ACCAGTCCAC ATAGAAACGA TGATGTCTTT GT C AAAACT A 2 6160 

CCAAAGCTTG TGTCGCTATT GTAGTAGATA CCGTCGTTAA AAGCCATTGG TTTGAGACCG 2 6220 

TGCGATTTTA CAATACGAGC GAGGTCATTG GCGTAGGCAA TAAATTTTTC ATAGCCTTTT 2 62 80 

ACAGGGTAGC CTTCGTTTGG ATAGTATTTA TCAGCTTGAA GCACACTCCA ACCTTTAGCA 2 6340 

TCTGTCGCAT CATTGGCATA TTCATCAAGT CCGATGTTGA AGATTTCAGT CTTTTTCGCG 2 6400 

AAATAAGCAG CATACTTGTC GATAAGGGCT TTTGTAAAAG CGACAGCTTG TTCGTTGTCA 2 6460 

AGATCGACAG TACGGGCTGA TTTCTTCCCA AAATAGCTAA AGTTAGGGTT TTGGATTCCC 2 6520 

AATTCTTTCA TGGCATTGAG AATCGCATCC ATGTGTCCAG GACTATTTAC TGTCGGAATG 2 65 80 
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AG AC CG AT AC CTTTATCTTT GGCATAGTTA ATCAGATCTG TCATTTGACT TTCTGTTAAG 2 6640 

TGATTGCCGT TTGGATCGTT GTAATAATCA TTTGTACCTT TTTCAATGGC GCGTTTGACA 2 67 00 

TCGTCACTGG CATAGGTCTT GCCGTTAGCT GTGATGCTCA TATCGTCCAA CATGAAACGG 2 6760 

AGTCCATCAT TTCCGACTAA TAGGTGTAAA TCAGTGTAGC CATAATGTTT CGCTTTATCG 2 6 820 

ATGATTTCCT TGAGCTGTTC TGGTGAGAAA TATTTACGTC CAGCATCAAT AGAAACAATT 2 6880 

TTCTTTTTCG CTAGTTTTTC ATTTACAGTT GCAGCACGTT CCTTTCCTGC CTCTGTTGCC 2 6940 

GGTTTGTCAG CCTCTGCTTT CGCTTCATCT TTTTTAGCTG GTTTATCCTT GTCAGTCTTG 2 7000 

TCTGTATTTG ACTCTTTAGA ATCAACCTCT TTCGCTTCTT CCTTTTTAGG GCTAGCTTCT 27060 

TCTGCCTTTT TATTAGCAGT TTCTTTTTCA GCAGAAGTTG GAGTTACCAC TTCTGCTTTA 2712 0 

TCACTAGGAG TTGAACTAAC TTCCTCTTGT GGTTTTTCTT CTGTTTTTGG AAGACTAGCT 2718 0 

ACCTTATCAG TAGCTGGAGT TTCTGTTTCT ACAGTTTTTG GAGCTTCTGG TTGAAGCACT 2 7240 

GCTTTAGGTG TTTCCTCAGT CCGATTTTCG GATGATTGAG GGGAATCAGA AACCGTATGG 27300 

ATGGTCGGTT GGTTTTCTGT AGTAGTAGGA GTAACTCCAT CGGCTGCAAC AGTCTGTGCT 27 3 60 

TGGAAGGCAA ATCCAATTAG AACAGAAGCT GCTCCTACAG CGTATTTACG AATAGAAAAA 2742 0 

CGCTGTTGTT TTTCATGTTT CATTGCAAAA CCTCCTGATT GCATTGTTAT ATTGAT AG CG 2 74 80 

AT TAT AT AAA TCAACGCCTT TATTTTATTT CTTATATTAA TTTCTTATAT TAACGAGAGT 2 7 540 

CAAGAGGAGA TGACAAAAAA CTATAATAAG TATAAAAAAA TATAAAATTT AAACTTAAGA 27 600 

TTTCAGATTG GTCGGAAAAA ATACGTATAT ATATCTAGTA TAATTTTTGG TTCTATTTCT 27 660 

ATAAAATATT CCACAAATTA TAGAATTTTC CAAAAATAGG TAAGCGCTAC CTTTTTGGTG 2772 0 

TAGTATAATA AGCATAGAAA AAGCCCAAGC GATTAGCTCA GGTTTTCTTC TTAGTGATCA 27780 

CGGTCACATG AG AT AAAT TT AATCTTGTAG TAATCAGATC GTTTGTAAGT TTCACTGTAT 2 7 840 

TCTAAAACTT GGCCAGTTGA TTCGAGTTTG GTGATTTTAG TTTGTAGGAC AGTAGGGAAT 2 7 900 

TGTTCATCGA CTCCGAGGAC TGAAGCTGCA TGTTCTGGAG TTGGAAAGAC TATTTCGTTG 27960 

ATTTCTTCAA AGTGTTCATC ATTCATGTGA ATGTGGTAGT CTAACTTGAA ACGATTATAG 2 8020 

ATAGAACTAT AGTATTCAAG GTTTGGATAA TTTGCGTTGA TATATTGTTC TGGGATGTAG 2 8080 

GATGTATGGT AGATATAAAC GACACCGTTT GATTCGCGGA TACGTTCAAT CTTGTAGTAG 28140 

AATTGATCGC CGCGTAGACC CAATTTTTCC AAGTAAACAA GCTTGTTTCC GCGTTCAATT 2 8200 

GAAAGAACAG TTACCTTATC ATCTTTAGCA TTGAAGAGTT CAATATCTGA AAACTCTACA 282 60 

AGCTTGTGTT TGCGTGCACG TGAAACGAAG GTTCCTTTTC CTTGTTGGCG GACAATATAG 2 8320 

CCATCTTTGG CAAGGTCGTT TAAGGCGCGA AC AACTGT G A TAGAGCTGAC AT CGT AC AT T 2 8380 
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GAAATGAGTT CTGCTTCAGT GTAAAATTTA TCTCCACTGC TAAACTGCCC AGAGATGATT 2 8440 

TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 2 8473 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTGATGGGT GGTATGCGAG GATACAGTTC TGAAAATCGC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 12 0 

AGAAAC T GAT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGGT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTGGATC CTGTTATCGG 24 0 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 3 00 

TGTTTTGGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAG CG C AAGC 3 60 

CATTGTGAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG GAAATTATTT CTATTGATAT 42 0 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 4 80 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTGATGAAA TTCACCAAAT 54 0 

TCTTGGTGCT GGTAGCACTG GTGGAGACAG TGGTTCTAAA GGACTTGCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 6 60 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 720 

TCCTTCGGCA GAGAATACTT TTAAAATTCT TCAAGGAATT CGTGACCTCT ATCAACAACA 7 80 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCAGCG GTGGATTATT CTGTTCAATA 84 0 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 9 60 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 102 0 

AACACGCATT GCAGAATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCGTA ACCGTGCTGG 12 6 0 
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TTTTGATGAA GGAAATCGCC CAATCGGCAA CTTCCTCTTT GTAGGGTCTA CTGGGGTTGG 132 0 

TAAGACGGAG CTTGCTAAGC AATTGGCACT CGATATGTTT GGAACCCAGG AT G C GAT TAT 13 8 0 

CCGTTTAGAT ATGTCTGAAT ACAGTGACCG CACAGCTGTT TCTAAGCTAA TTGGTACAAC 144 0 

AGCAGGCTAT GTGGGTTATG ATGACAATAG CAATACCTTA ACAGAACGTG TTCGTCGCAA 1500 

TCCATACTCT ATCATTCTCT TGGATGAAAT TGAAAAGGCT GACCCTCAAG TT AT T AC C C T 15 60 

TCTCCTCCAA GTTCTAGATG ATGGTCGTTT GACAGATGGT CAAGGAAATA CAGTAAACTT 162 0 

CAAGAACACT GTCATTATTG CGACCTCAAA TGCTGGATTT GGCTATGAAG CCAACTTGAC 1680 

AGAAGATGCG G AT AAAC C AG AATTG AT GG A CCGTTTGAAA CCCTTCTTCC GTCCAGAATT 174 0 

CCTCAACCGC TTTAATGCAG T CAT C G AGTT CTCACACTTG ACTAAGGAAG ACCTTTCTAA 18 00 

GAT T GT AG AT TTGATGTTGG CTGAAGTTAA CCAAACCTTG GCTAAGAAAG AC ATTG ACT T 18 60 

GGTAGTCAGT CAAGCGGCTA AAG AT TAT AT CACAGAAGAA GGTTACGACG AAGTCATGGG 192 0 

GGTTCGTCCT CTCCGTCGCG TGGTTGAACA AGAAATTCGT GATAAGGTGA CAGACTTCCA 1980 

CTTGGATCAT TTAGATGCTA AACATCTGGA AGCAGATATG GAAGATGGCG TTTTGGTTAT 2 040 

TCGTGAGAAA GTCTAAGACA GAATTTTGAG GATAAAAAAG AAGG AG C C AG CTGAAAAAAA 2100 

CTGGTTCCTT TTTAGGTACG ACAGGCATGT CGTATAGTAG AAGTGTATTA TTCTAGTTTC 2160 

AATATACTAT AGTAGCTCAG AAGTCGGTAC TTAAACGTGC TATATCAAAA CCAGTCCTGG 222 0 

AAAAACGTGG ACTGGTTTCG TGTTTGGATT ATTACCTTGA ACGACATGCG TTAAAAGTTA 2 2 80 

GTTGAACCGC CGTATGCCGA ATGGTACGTA CGGTGGTGTG AGAGGGGCTA GAG AT TAT C C 2 34 0 

CCTACTCGAT TTTAAATCAC ATGACGTTCA AAGGCATCAT CTGAAATCCC TTGTTCCAAG 2 400 

ATGAGTTTTG CCCATTCTTT AGCAGAGAAG AGGCTGTGGT CCTTGTAGTT T C CG C AAG AT 2 4 60 

TCGATGGTTG TCCCTGGGAC ATCTTCCCAA GTAGTAGTTT CAGCGATTTC CTTGAGCGAA 2 520 

TCCTTGATAA CAGCTGCGAT TTTAGCACTG GTGTGACGTC CCCACATAAT CATGTGGAAG 2 580 

CCTGTGCGGC AACCAAATGG TGAAC AGT C A ATCATGCCGT CAATGCGGGT ACGGATGAGT 2 64 0 

TTGGCTAAGA GGTGCTCGAT AGTGTGAAGG CCGGCAGTAG GG AT AG AGT C TTCGTTTGGT 2 7 00 

TGCACCAAGC GAATATCATA ATTGGAGATG ATGTCTCCTT TTGGTCCTGT TTCTTCCCCA 27 60 

ATCAAGCGAA CATAGGGTGC TTTGACAATG GTGTGGTCAA GTTCAAAACT TTCGACAATA 2820 

ACTTCTTTTG ACATGGTAAA TCCTTTCAGT TTTCTTCTCT CATTATATCA TAAAGGTTGC 2880 

T CCTG AG AC A GAGAGAAAAC CTCTCCGAGG CTGGAGAGGT TGAAATCTTT ACTTACGATA 294 0 

TAAGCGGTCG TATTGGTAGT ATGGGTCAAA GGTTACGTTG AT AC CC AGTT TACGAAGGAC 3 000 

ATTCTTGTCT TCATCAGTCA AGATGATGGT TGAGTGGGCT TCGCTTCCTT TGAGGTTGCC 3 060 
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GAGTTCTTCC ATAGCGCGGG CAGCATCAGG ATTTTCTGTA GCTGTGATAG CAAGTGCAAT 312 0 

CAGGATTTCA TTTGAATGAA GGCGTGGATT GCGGCTACCG AG AT G AT CG A TTTTAAGACC 3180 

TTGGATTGGC TTAACAACTT CAGGCTCGAT TAGTTTTACT TCTTTAGCGA TGTCAGCTGA 32 40 

TTTTTTGATG GCGTTGATCA AGGCAGCGGC TGTAGGACCA AAGAGTTCTG AGTTCTTACC 3 3 00 

AGTGATGATT TCCCCATTTG GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 3 3 60 

TTTTTGGCGC GCAACGACAG C AACCTT AC G GTCTGCAGGT GTGATACCGA GGTCGTTCAT 342 0 

GAGCAACTCA ATTTTCTTGA CGGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 3480 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3 540 

TGTAATAGCG AAACCAACCA TGTTGACACC CATATCTGTC GGTGAAGCGT ATGGTGATTT 3 600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT CACGGTTGTA 3 660 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGGAAGGGG TCAATCATGT TG AC AT CATC 3720 

AAGGTCAGCT GTGGCAGCTT CATAAGCCAA GTTAACTGGA TGATGAAGGG GAAGATTCCA 3 7 80 

AACAGGGAAG GTTTCAAATT T AGCGT AG C C AGATTTGATG CCATTGATTT GGTCGTGGTA 3840 

CATATTGGAC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGCGG TTACGACAAT 3 900 

CAAGTTGCGA CTGGTTTTGA TGTAGTCGTT TTTGCCCATG CCTTCTGGGG AAATGATGTG 3960 

ATCCATATCC GTCGGATATC CTTTGATTGG ATAATGAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTGCGGAAGG CATCTGCAGC GGGTTGGCCA GCGTATTGTG TAATGACAAC 4 080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAAGAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGGC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTGT TGTCAGGTTC 42 6 0 

ATAACCAGGA AGGACACGAG CAGCGTGGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 43 2 0 

GTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCGTT GTAAATTCAA 4380 

AT ATTGT T C A GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGGACT CTATTATAAT 4440 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAAGA TTAAGCAAAC 4 500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4560 

AACAGATGCA AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4 680 

ATCTAGCTAA GCTAGGAATC ACAGCAATTT GGCTTTCTCC CGTTTATGAC AGCCCTATGG 4740 

AT GAT AATGG CTATGATATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG 4800 
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AGGACATGGA T C AG C T G ATT GCAGAAGCTA AG AAGCGT G A CATTCGTATC ATCATGGACT 4 860 

TGGTGGTCAA T CAT AC CT C A GATGAACATG CTTGGTTTGT CGAAGCCTGT GAAAATACTG 492 0 

ACAGCCCTGA GC GAG ACT AC TATATCTGGC GCGATGAACC CAATGACCTA GATTCTATCT 49 80 

TTAGTGGGTC TGCTTGGGAA TACGATGAAA AGTCAGGTCA ATACTATCTC CACTTTTTCA 504 0 

GCAAGAAACA GCCGGATCTC AACTGGGAAA ATGAAAAACT TCGCCAGAAA ATTTATGAGA 5100 

TGATGAACTT CTGGATTGAT AAAGGTATTG GTGGTTTCCG TATGGATGTT ATTGACATGA 5160 

TTGGCAAAAT TCCTGACGAG AAGGTAGTCA ATAATGGTCC TATGCTCCAT CCCTATCTCA 5220 

AGGAAATGAA TCAGGCGACC TTTGGAGATA AGGATCTCTT GACAGTAGGG GAGACTTGGG 52 80 

GAGCAACTCC AGAGATTGCC AAGTTCTACT CTGATCCAAA GGGGCAAGAA TTGTCTATGG 5340 

TCTTCCAGTT TGAACATATC GGTCTTCAGT ATCAGGAAGG TCAGCCTAAA TGGCACTATC 5400 

AAAAAGAGCT GAATATCGCT AAGTTAAAAG AAATCTTCAA CAAATGGCAG ACAGAGTTAG 54 60 

GAGTTGAGGA CGGCTGGAAT TCCCTCTTCT GGAACAACCA TGACCTCCCT CGTATTGTCT 5520 

CAATCTGGGG AAATGACCAA GAATACCGCG AAAAATCTGC CAAAGCCTTT GCAATCTTAC 5580 

TTCATCTCAT GAGAGGAACT CCTTATATCT ACCAAGGTGA GGAGATTGGG ATGACCAACT 5640 

ATCCGTTTGA AACACTGGAT CAAGTAGAAG AT AT TGAAT C TCTCAACTAT GCGCGTGAGG 5700 

CTCTTGAAAA AGGTGTTCCG ATTGAAGAAA TCATGGACAG TATCCGTGTT ATTGGACGTG 57 60 

ACAATGCCCG TACCCCTATG CAATGGGACG AG AG C AAAAA CGCTGGTTTC TCAACAGGTC 582 0 

AACCTTGGTT GGCGGTTAAT CCAAATTACG AGATGATCAA TGTCCAAGAA GCGCTGGCAA 588 0 

ATCCAGATTC TATTTTCTAT ACCTATCAGA AACTGGTCCA AATTCGCAAG GAGAATAGCT 5 940 

GGCTAGTTCG AGCTGACTTT GAATTGCTTG ATACGGCTGA TAAGGTCTTT GCTTATATAC 6000 

GTAAGGATGG CGACCGTCGC TTCCTAGTTG TGGCTAACTT GTCCAATGAA GAGCAAGACT 6060 

TGACAGTAGA AGGAAAAGTC AAATCTGTCT TGATTGAAAA CACTGCGGCT AAAGAAGTAC 612 0 

TTGAAAAACA GGTCTTGGCT CCATGGGATG CTTTCTGTGT GGAATTACTA TAAATATTTT 6180 

TTGCAGAAAA ATTTAAAATT GAAATCGTAT AAAAACAAGG GAGGACTGTA TAAAAGACAG 624 0 

AAATCCTTTG TTTTTTATAA CCAAAGTTTA TAAACTTTCA TTCTTGAAAT TCAATTAACT 6300 

TTACAAATTC CCACTATTAA GGAGAAAGAA GATGAACATA AAGAAG CGTG TCCTTAGTGC 63 60 

AGGCCTGACT TTTGCATCTG CTTTGCTTTT ACCCAAATCA TTCATACCTC TCTCAACTAG 642 0 

ATGTAACTTA CAAAACCCCT GACCTCATGA GCCACTTTCT TCCTCCTCAT GAGGTCAGTT 6480 

TTACTTTCTG CTGTTCCAGT ATCGTTTTTC CTCGCTAGAT TTCCTCAAAA GGGCAGACTC 6540 

CTCCCTTGGT GCGTCACACG ATTTTTTCAT CTCGACTGTT CTTTAATGCA TCATTAACGA 6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6 660 

AGAATAAAGT GCTGAAAACA ATTCGGAATA GGCATAGAGA CTAGACAATT TGAGGAGCTG 672 0 

CTTGCGTCCT GTTCGAACAC ATTTTCCGG 674 9 



(2) INFORMATION FOR SEQ ID NO : 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TCTACCCATG GACTTTGAGG CATTCATTGT TCCATCTTCT AGTGGCGAAT CTTTTGATAC 60 

AAACGATTCA ATTCACTTGG ATAGTGAAAC TCTCCCGCAA ACATTTTTCT GGTTAACTCA 12 0 

ATCCAGCTGA TATTTCTTTC AGCCAAAATA ATGGACAAGT TCTCCCAAAA TCGTTCAGCC 180 

ATATTGCTTC TCCTTTAGTT AGATAAATAA TGTGTTTGCG CCATGTAAAT CAATTGTTTC 2 40 

GTATCTCTTG GCAATAGAGC TCTAGCCTCT TCCAAATTCA GACTTGGATA AACTCGCTTA 3 00 

TTTGAAACCG CAAGAGGAAG TCTGATGGTT AGTTCAGGAT TTTTTAAAAT TATCTCAACG 3 60 

AAATCCGTTA ATCTTAGATT GTCACGGTTC TTAAATCGTA ATAAATTGGG AGATAAAAAC 42 0 

TCAAAACAAT CTGAAGAATA GCT CAT CATC TCAATTAATT TGTCCTTTGT CATTTCAGAA 4 80 

ACTGAATGAC AAGATACCTC TATGCCATAG TTTTGGAAGA AATCTAAAAG AAGTTGATTT 54 0 

CTTTGTCTAT TTTTACTTAG AT AG AG AT C A AT C ATGG G AG ACCTCCCAAA GATTCGGTTC 600 

CATTTGATAT TCTGACACGA TTAAGGAATC TAATAAATTA AGGAATCTAA TAAATTTGCG 660 

AAGTTAATCG GTTTCTTGTC TTCATCATAA GCTTTTACAG TTACTTGGGT TGTAAGTATT 720 

CCCTCTTTTC CCTCGGCTCG ATAGCCTTGT CCATATAAAA CAAAAACGAG ATTTTGATGA 7 80 

TCATCTACAA AGGCATCAAC CCCATTCTTT ATGTCTTGAC TTTCAAGGAA TTCCATAACG 84 0 

TTTTGAAGAT AGGATTCGTA AAATAGTGGG TAGTTATGTT TTTTATGGTA AT C AT CT AAA 900 

AATGTCACTT CAAACTCACA TGGAGAGTAA TTTTGACTTT GAACAGCCTA AAAGTGCCAT 9 60 

CAAATTTGAA TTGGAATAAA TCAAATAAAT AGCCCCATCC TCATCAATCC AACCTTTGCT 102 0 

CAAAGACAAC TCCAACCGAT CTTTTAAAAC TGAGTAAACC AC CT T AAC C T CCAGTTTCAT 1080 

ATTCTTATAC CGTTCACTCT CAAATAAAAG TTTGGGGAGC TTATAATAAC GCTCTGATGT 114 0 

CTGATATTGA TTAGCGGTAA TACGCTTCAT TATTGTCCCT CCAAGACTAA AATTCCAACA 12 00 
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TTTCCAAATT CATCAAATCG GATTAAACCT ACTTGTTCCA TTTCATCAAC TAACTGAGTT 12 60 

GCTTTTACCC AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC 1320 

TCATGAGCCA CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG 13 8 0 

TTTTTCCTCG CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT 144 0 

TTTCATCTCG ACTGTTCTTT AAT GC AT CAT TAACGACGCT TTTCTTCTAG GTGGTTCATA 1500 

AGGAACAGGA AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC 15 60 

GGAATAGGCA TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT 1620 

TCCCACCACG TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC 1680 

T C C AG CT AG A TGTTTGAGAA AAAGATAGAG ATTGTAGGCG AT AC AG CT C A TCATCATACG 174 0 

AACTTCGTTT TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT 1800 

CTCCTTGATG AAATTCTCGG CTTGACCACG TCCACGATAA AG 184 2 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAG C CAT TAT AACAGAATTT TGTGAAAATT 60 

CCT AT TAT AG T AAATC AC T A TTTCAGTATA AAAAGAAAAA ACGAATCAGA CGATTCGCTC 12 0 

TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC GATTTTTTGC AG AT TG AG C A 180 

CTGCATCGTG ACTCATCAAG ACTTGACCAT ACTCTTGTAA GACTGAGCGA C T GAT AT C AC 240 

TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGG CTTGGATGAT 3 00 

TT TC AAT AT C CACTAAAATG GTCAAATAAT AGCGCTCGTT C AT TTTGT AG AGTTCAGAAG 3 60 

TTTCCATTTC AAAAGT CACT GTCTTGGCAA AAGCTACCAA GTCAGCCAAC TTAGCAAAAG 42 0 

AAAGGATGTA GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA GCCTGCTCTT 480 

GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAG AT TG AAT GGCTTCGATA TCATCCTTGG 54 0 

TTTTGTCTGC GATGCTTTTT TCCAGGGTTT TGATAAATTC AT CTGG AG AC ATTTGAGCCA 600 

ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TAGATTTTGG TCAATCTTTG 660 

ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG CTCAACATGC 72 0 

CTGT AT CC AG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA AAGAACTCTT 7 80 
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CTGTTTTTTC TTGAGGAACG AG AAAGT C AG CAATCTCCAT TCCACGATCC ATCAAATCCT 84 0 

CTAAAGATAT CGTGATTTTT AAAGT T GT AT CACTAATTTG TTTCATTTTC ATTGCTAGTA 900 

ACCTCATACT TTCAGTTCTA TCTATTATAC TAGATTTTTA CGATTTTATC AAAAGAAGGC 9 60 

TCCT C TAT AC GG AT AG ATT T TCCCTAGGGT CTTTCTATAG GAGACTCCAA AAGAAAATTT 102 0 

C TGC AG AC AG ATAGAAAAAG CCTTCAAAAT CGGCTAAGAG CCGACTTTGA AGACCTTATA 1080 

CATCAGAATA CTTATAATTT AAAGGTTGCT AC ACC G AG G A TAGAACGATT TAAGTTTCTG 114 0 

AGAATTTGAA GACTTTGCTC AAATTTCTTA TAACGAGTCA CTCCGTACTC TTCAACAAGA 12 00 

AGGACTGTAT CTCTTTCCAA AAGAGATGAT ACATCCTGTA AAT C T AC AAA ATGCATTCCT 12 6 0 

TTTAAAGCTT CTTGACTCTG TTTCAATTTA TCTAAGATAG CTTTATTTGA GCTAACGATG 13 2 0 

GTCAATTCCT GTCCAGTATT TTTGTATGAC AAAACATCTG CTAGGTTAGC AATTGTTGTA 13 80 

ATCTCTGTTA CAAAATCAAT TTGATACTGA GAAAAATCAC CT AC T C T AT T GATTGTTGGA 144 0 

TTAAAGAGAT AAACTAACAC ATTTCCCATC ACAACCAAAA TCACACAAAC CACTCCAATA 1500 

ACAACTAAAC GAAGAATCAG ATTTTTCACA TTTAAGCCAA GCGCTGTTTC AC C ATT T G CG 15 60 

TTCAATTCTT TAGAGTTGAT GGTTTCCAGT TTTTCAATTT T C AC ATT T G C ATAGGCATGT 162 0 

TTAAATTTCT CAATCAACCC ATCAATTTTT TTCTCTAACA AGTTATTGGC ATCTTTACTT 1680 

GATGTCAAAA TTTTCACACC AACCCCTGCA TCGTCAATCA TATAGTAGAC GGTCAATTTT 1740 

TTCCACCAAT AGTCATTCGT TGAATTTTTC AAGGTTGTTT CTGTCGTGTC TAATTCACTG 18 00 

GCAATTTTTT TCAACTCACT GGGTTCTACA TCATTGAAAA GATAAGCTCC ATTCAAATTA 18 60 

CCATCAATCA ATTTCCCATA AAAAT C ACT A TAACCACCAA TTTGATGATT CAAAATCGTT 192 0 

TTGTCCGACT CTTTTGGAGG AGTGATTTTA TAGATAAGAT AAGTTGAATA ACTTGTTGTA 19 80 

TCTTTGACAG TGTTTTTATT CCTAACTGCT TTAATTGTAA ATGGTACAGC AATGAGAGCA 2 04 0 

AATAAAGCGA TGAGAGCTAA AATATTTGCT TTTCGCTTTT TATAAAGATT TGCAAACAAA 2100 

TCAGCTACTG AATAATGTTC AAACATGATT TTTTTCTCCT TTGTTTAGTA GATACTAGTT 216 0 

TTCCTTTGTA AGCATTTTTG CTACAAATAT AATCACAAGA ACAATTCCCC AG AATTG C AT 22 2 0 

TGTAAATAAA TTGAAGAAAC TTTCTGAAAA GCTGCTTCTT GGCATAAAGA ATAGATTATT 2280 

CAAGATGAGT AGGGATAAAG CAAATAGGAT TGTCCTTGAG CGATAGGCTA CTTGCAGCAT 234 0 

GGCTATAAAT AATACGCCGA GTAAGAAACT AAGCAGAAAG ACTCCAATCA TACCATAGTC 2400 

GGTATACAAC TCCATGATAT AACTACTTCC GAT AC CAT G C CCTTTCAAGT ATTCCTTGTT 2 460 

CAAGACAAGA TAGGATAGAT TGTGGGCATA ACTATTACTA T C AAT AG C T A GTTCCACACT 2 52 0 
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ATTGGTTGTA TGTTCAAAGG CTTTTCCTCC GAAAATGGCT CCCAAACTCC CCCTTGCAAA 2580 

ATAATCAAGA ACAGGACCAA AAGTAAAATT ACGGAAATCT CGGTAAGGGA GGCTACTGTT 2640 

AAATAGAAAA CCTCGAGCCA GAACACCAAA ACTAGTCCCT TGTTTATAGA TAAAGTCAAG 2700 

TAAGATATCC CAGAAACCTG TATGGGAAAC TTGGACATTA TCCCGTACAT AATTGAGTAC 27 60 

TCCCATCGCT AACATGAGAA TAGGAGAACC TACAAAAATC GCTAACTTTT CTTTAAACCC 282 0 

AATCCATTTT CCTTTTTCAG TTTGCTCCCG CATAAAGTAA TAAACAAAAG CAAATmAAAT 2880 

ACTTAAAATA AAGGGATTTC GTGTCCCAAT TGCCAAATGA ATAGTATTAG CTGCAATAAA 2 940 

GGAGACAAGC ACTGCTGTGG CCTGCAATTT CTTTGGCTTG GTTGCCAGAT ACATACACAT 3 000 

TGCATAGACC GTAAAGGTAG ACAAAATGTA GGTAAAATAA GGCAGTTTAC TTTCAAAATT 3 0 60 

TGCATAGTAG GCATAGTAGG AAGTCTGCAA ACGATACAAG AGCCGTTCAA ATAACCGAAT 312 0 

GAAATAGAAA GGATAAGTTA GAAGAAAAAC TCCTAGTGAT ACAAAGCGTA ACCGCTTGAT 3180 

ATAAACCTCT TTTAGAGAAT TTCCTATATT TGCTACTTTT ATTTTCTTCC TAGCTATGAA 3240 

GTAACGAGCC AGAATGCCTC CTGTGGTCAA GCCCAGAATC G AAAT CAT G A CAACTATAAA 3 3 00 

GGCAAAACGA TAGGCTATTG GATGATAGGT ATCCAAAGCA CCATCCCTAA AATAATCAAT 33 60 

GGTCGGTCTT GAT AC C AG AA ATACAAAAAT GGTTAAATAG AAAATAAAAT GGATTAAGTA 3 42 0 

ATACTTGATA TCATTCCAAC AAGCAATTAA GCTACTAACC AACAAGAACA ATAAAGTAGA 34 80 

AAGTAAGCTA ACATTATTAT TATTAAACAG ATACACAATT CCACTTACTA GCGTCAAGGC 3540 

ATAACTGACT ATGGTCAAAC TAAATAATAA TCGTTTCCCA TCAATCACTT GGTCACCCCC 3 600 

GTTCTAATGT AATTTTTTAG ATTTTTCAAT ATTTTTCAGT AATAAGAATC GATATAAGGA 3 660 

AATATTTATG AATAGGGCCA AAGCACTAAT TCTTCTCCCC TTACGGAAAA TTGGATTCCT 37 2 0 

AGAAATAGCA AAGGCATGGC CTTTTAAAAA ACGATGAATC TGAGAATAGG CTTCAAACTG 37 80 

TTTATACTGA T C AT CT AG C A ACATCTTATC CAGAATAAAG AAGTGGGCAT AGGCCAATCT 3 84 0 

GAAAAAAGCG ACCTCTTTCA AG TC AGGAT A GTTTTTCACA ACTTCATTAT AAAACTTTTG 3 900 

GTAGATATCA ATATAGGCTA AATCCTTCTC TGCATAGGGT TTGGTCGTAA TACTATCCCC 3 960 

TCTATGGAAA TAGTAATAAT AGGGTTTAGT ATTAACCACA TACTTCTTGG CCAACTTGAT 4 02 0 

TAAATCAAAA TGGTAATAGG CATCTTCGTA AATCAACCCC TTAGGAAAGG ATAGGGCAGT 4080 

TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA GGTATTTTTT CACCTATGAG 4140 

GTATTCCTTT AGAAATGTTT GAGAATCACA GACAAAATAG T C AT CCTG AT TGGCTGACTG 4200 

TGGGCTTTCA TCATTAGCAT AGACATTCAT GACACCACAG CTCGAAACAT CCGCATCTTC 42 60 

TTGAACTAAT TGCTCATATA AGCTCTGAAT CATTTCTGGA TGGATATAAT CATCTGAGTC 4320 
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AATAAAAATC 


AGATAATCCC 


CGTGAGCCTG 


CTTCATCCCA 


TCATTTCGTG 


CTTGCGACAA 


4380 


TCCTTCGTTC 


TTTTTATGAA 


GCACTGACAC 


CCTGTCATCT 


TGTTCAGCGA 


TTGAATCACA 


4440 


CAAGCGACCA 


CTTTCATCTG 


TTGCACCATC 


ATCAACAAGA 


ATAATTTCCA 


GATTTTGATA 


4500 


GGTCTGCTTC 


TGAATGGAAG 


CTATCGATTT 


TTCTAGGTAC 


TGCGCCACAT 


TATAGACTGG 


4560 


CACAATCACA 


CTAATTAATG 


CAGTTTCCAT 


GCTACTCCTC 


TAATAGTTTT 


TCTACTTGTT 


4620 


CGATTTGTTT 


TGTAATTGTA 


AATTGTTGAA 


TGAATTGGCT 


AGCCTCATCG 


ACATCAAAGT 


4680 


TTGAGGCAGA 


AGTCATGTAA 


TTAGTAATCG 


CCTGAGCTGC 


CTCTTGATTG 


CTCTCAATGA 


4740 


TTTGTCCAAA 


TCGTCCTTCT 


TGGGATAATT 


CCTCAGCCCC 


TCCAACGTCC 


G TAG AG AT AA 


4800 


AAGGGAGTCC 


CAGACTCAAG 


GCCTCCACAT 


ACACTCCAGG 


AAAACCTTCT 


TGT TT AG AC A 


4860 


TAGACAAAAG 


AACTTTCGTC 


T GAG AT AG AT 


ACTGATAAGG 


ATTTTTTTGA 


TAACCAAGGA 


4920 


AATGTACATA 


GTCCTCAATC 


CCATACTCTT 


TGACTCGTTT 


TTTCAGTTCC 


TCTTCCATAT 


4980 


CACCAGCCCC 


GATAAAATAG 


AGATGATAGT 


TTTTTCCCTC 


TTGGTGTAAT 


AATCGTATCA 


5040 


CTTCCACTAC 


ACGGTCAGAA 


CCCTTATTTT 


CCTCAATCCG 


TCCGATAGTA 


CAGATACTTT 


5100 


GAGGAGCAAT 


CTCGATATCG 


ATCTTCTCTT 


GAGATTTTTC 


TAGAATAGTC 


TG AAAAT CAT 


5160 


ATCCATTGTA 


GATTGTCTGT 


AATTTAGAAG 


TATAATCTGG 


ATAAACTTCC 


TTGATAGAAT 


5220 


TGCTGGTCTT 


TTTTGAAATC 


CCTACAATTG 


TATTCGCAGC 


ATCCAACTGG 


CTTCTATGTG 


5280 


ATTCTCTTTT 


AGAGCTATCC 


TTAAGAAGTT 


CTTCAATACT 


TCCATGAATC 


CAAGATATCT 


5340 


TCTTGACTTC 


TCTTCTTTTA 


GAGAACAACA 


GTGGTGGATT 


CATAATGGTA 


AAAGAAACTT 


5400 


CAACATCATA 


ATCATCTTTT 


ACAAGCAAAC 


GACGAGTCAG 


TCTTGGAAAA 


TAAATTCTCA 


5460 


TTCTCCACAA 


AAAAGCTCGT 


AACCATCTGG 


TTTGGCGATA 


ATCTTGAAGG 


GATTTTAAAA 


5520 


TGCGTACATG 


CTTTGGAACA 


GATTCATATC 


CCTTGTCAAA 


GTGCTCCATT 


TCAAGAATAT 


5580 


CAATATCATA 


CTTTTCTGGA 


TCCAGATTTG 


AAACAATGGT 


TGATAGAATC 


TTCTCTGCAC 


5640 


CACCTCCAAG 


AGAAAAAGAC 


CACATAAAAA 


ATAAGATTTT 


TTTCTTAGCC 


ACCATATTCT 


5700 


CCCTTGTATT 


CTGTATAAGA 


CTTATCCATA 


TCAGCGATGA 


CAGCATCATG 


ATGCGGTACC 


5760 


TGCTTGTCTG 


CTGGTGGAGG 


CGTCATATAA 


TCCCCAAAAG 


CAGTTCTGAG 


ATAGACATCA 


5820 


TAGCCGATTG 


GAATAGGCAT 


CTCTGTTCCT 


TCAAATGGCA 


AGAAAAGATT 


GTCTTCAAAA 


5880 


GATGTGATTG 


GGTACTTGTT 


TCTCATGTAG 


CCAGGACCTG 


AGCATAATTC 


TGTAATGCCA 


5940 


TCACAATCAG 


C C AAATC AT A 


CTTAGTCATT 


TCTTTCTCAG 


CTTTTTTCCA 


GATGCGATAA 


6000 


CGGAGAGATT 


TTGGAGTCAA 


ACCCAGTAAA 


ATGCGACTTC 


CCCATTTCAT 


GAGATCACCA 


6060 
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TGCTTTTCTG 


GAATAGTTTG 


CGCACAAAAG 


AGTGAATAAA 


TCAAGGCCCA 


ACGAACCTGT 


6120 


TTTTTCCGCT 


CAGCTGGATT 


TTTCGGATAA 


TAATCCAAAG 


GCAAAACATC 


CAAGGCCAGA 


6180 


CCATGTGGCA 


AATCCAAATC 


CTGCTGATAA 


GGCTTGATAC 


AGGTGGTTTT 


CTTGTCACGA 


6240 


ATGGTAATAA 


AAAG AT T AC G 


ATCAACAAAA 


TCCTTGTGAC 


TCTTTGACAA 


GAAATAACGT 


6300 


TCATCTGCAT 


AACGAGGCCA 


TAATTCTGCT 


AATTTCTCAT 


AATCTTTACG 


AGGCATAAAA 


6360 


AAGTCTAGGT 


CGTCGTCCCA 


AGGAATAAAT 


CCCTTGTTTC 


G AAGGGC AC C 


AATAGCGCCT 


6420 


CCGCCACAGA 


GATAACAGAG 


CAAATCATGT 


TCTTTACAAA 


AGGCCACAAA 


ATATTCAGCC 


6480 


ATCTCCAGAC 


TACGAGCCTG 


AATTGCTTTT 


AAATCAGTCA 


TATTGTTCAT 


TATTCTTTCT 


6540 


ATCGTATCGT 


TTCATTATAC 


CACAAACAAG 


GGGTGAAAAT 


CTATTGCAGA 


CTGTAAAAAA 


6600 


TCAAAGCCTG 


ACTGCTATCC 


AAATAGCTAT 


CAAACTTTGA 


TTTTTCTGTC 


TTATACTCTT 


6660 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCTTCA 


CCTTGCCGTA 


GGTATAGGTA 


ACTGACTTCG 


6720 


TCAGTCTTAT 


CTACAACCTC 


AAAACTGTGT 


TTTTAGCAGC 


CTGCGGCTAG 


CTTCCTAGTT 


6780 


TGCACTTTGA 


TTTTCATTGA 


GTATTATCTT 


ATCTTAAGCC 


CATTTGAGCG 


AGCTTGGTTT 


6840 


G AT ATT TG TT 


TTGATCAACC 


AGCAGGCCCA 


AGCCCCCATA 


AACATCATAG 


GCATCTACCC 


6900 


AGTCACCCAG 


TTCTGGAATC 


GTCAATTTTT 


CAATACCATT 


TTTTGCTCCA 


TCCAAAACAG 


6960 


ATAAACCGTT 


TGTTAGGAGG 


AAAGTATAGG 


GTACGTTGGT 


TGAGGTCATA 


GCAAAAACCT 


7020 


TTCCAAGAGC 


TTCAGAACCA 


GTGAAAAGTT 


TAGTGGGATC 


TTTAATTTGC 


TCTAAAATTG 


7080 


CTGTTAAAAC 


TTGTTGCTGT 


CTTTTTGTAC 


GGCCGTAATC 


TGC C T CAT C A 


T CATC ACGG A 


7140 


AACGAGCATA 


ATTGAGCAGG 


GTCGAGCCAT 


TCATCTGCTG 


TTTTCCGACT 


TTAATGGTTT 


7200 


GGGTTGGAGA 


CTCAGTCTCG 


GTAGCGTATA 


AATCATCTCC 


GACTGTAGCT 


TCTGTTAGGG 


7260 


GACGCCCATT 


CAATGTTGAA 


AATTGAGCAT 


CAATCGTCAC 


CCCATCAGGG 


AAAAGCGTGT 


7320 


CAATCGCTGT 


GGCAAAGGCC 


TGGAAATCAA 


CCAAGGCGTA 


GTACTTAATG 


TCCAAGTCAA 


7380 


AATTATCTTT 


CAAGACTTGG 


CG AAC C AT TT 


CTGCCCCTTT 


TTGCCCCTCT 


TGTTCTCCTA 


7440 


ACTCGTAGGC 


TACGTTTAAC 


TTGTTATCTG 


TCTGTTTTCT 


ACCATTAATC 


ACTTGACTAT 


7500 


AACCATCTAT 


AT AG AC C AAA 


TTATCACGCA 


TGAAACTGAC 


TAGCTTCATT 


TTCTTATCTG 


7560 


AGCCCCCGAC 


ATTTAATACC 


ATAATAGAGT 


CAGTTCGTGT 


CTCAACACTG 


TTCTGGCCGA 


7620 


TTCGACCATC 


AGTACCCATG 


ATTAAAATAT 


T AACT C CAT C 


TCTAGTGTCC 


TG AC C ATT AA 


7680 


AGACTTCTAC 


TTGAGCTGCC 


CGGGCATCAG 


CAGTTTTCTT 


TGCGCTAGCA 


TCTTGGTAAC 


7740 


CACGCAAAAA 


C ATGAAT AC C 


ATGGCCAAAG 


CCACACAGAC 


CAAAAGTGAA 


AAAATCACCA 


7800 


TAAAAATTCG 


TTTAAGACGG 


AGCTTCCGTC 


TTTTCTTTTT 


TGGAGGGAAA 


GAGAGTGCTT 


7860 
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GTGATTTGGA TTGTGAGCGA CTCCGGTTCG CATAGCTTGG TAAGTCAACC TGCTCTTCTC 7 92 0 

TTTCTTGTTC CAAGCTAGAG C T AC TAT TT C CCCTAGCAAG AG T TAG C TTT TCTTGCAAAT 7 980 

AGGCAAACTC ATTTTTTTCT CTCTCATTGA GATAGTGAAT ATTTTTTAGC AAATAATCAT 804 0 

AACGCAACTG CTCATGATGA CTTAAGGGAT TTTCTTTACT CATCTTCTCT CCTTTCCATG 8100 

GTCTGATATT GGATAAATAG GATAGGCACC CAGAATTTTA TACTGGATTC CAATCGCTTC 8160 

TAATTCTTTT TGGGCAAAGT GGACCAAGTC CTTATCGGTA TAATCCACAT CGATAATGAA 822 0 

AAAGTATTCA CCCAGTGCTG TCTTGAGTGG ACAACTTTCA ATTTTTGTCA AGTCAATTCC 82 80 

TCGCCAAGCA AAGGTCGACA GGGCCTTATA AAGTGCACCT GGAAGGTTGT CAGGTAATGT 834 0 

C AAGGC C AAA CTCATCTTTT CAGTTTGTGC TTGCAAGGGA ATACTAGGCT TTTCAGCTCC 8400 

TAGAACCCAG AAACGTGTGA AATTGGCTTC CATTTCCTGA ATATCCTCGG CAATCAGTTC 84 60 

CAATCCATAT TCTTCAGCAG AACTTCTAGG TGCAACTGCT GCAAAGGGCT GGTCTGGATG 8 52 0 

TTCGGAAATA AAACGGGCCG CATAAGCTGT ACTAGCTGTT ACCTCGATTT GAGCCTCTGG 8580 

ATATTGTTCA TCGATGAATT TCTTTCCTTG AG C C AAGGC C TGTGGATGTG AAAAAATCTT 864 0 

TTCAATCTTA GTATGGCCTG GAACCACCAT CAACTGCTGA TGAATAGGCT GAACGATTTC 87 00 

TGCTACTGCT TGGATGTGAG CCTGATGAAA AAGATAGTCC AAGG T T T CAT GAACACTACC 87 60 

CTCAATAGAA TTTTCAACTG G C AC C AC AG A ATAGTTCACT AATCCTTGCT CATAAGCCTT 882 0 

GATGACATCT GTAATGTTGG CAAAAGCCTG CAATTCCTCA TGAGGAAAAG CTGTCTGCAC 888 0 

AACGTGGTGT G AAAATG AT C CCTTGGGACC TAGATAAGCA ATTTTCATCT TAGTTCCTCT 8 94 0 

ATAATTTCCT CTGGGCTTAG CTTGGTCACA TCCAAAACCC GACTAGCCAC TTCCTCATAC 9000 

CAAGCCTGTC TTTCTTGGAA AATAGCTACT AGTTCTTCCT TGCTATTATT TAGAAAAAGC 9 06 0 

GGTCGCTGAT TGTCCTTATC AGCTGCGATA CGTTGGTAGA GGGTTTCAAA ATCTGCTCTC 912 0 

AGGTAGATGT TATCTGTATT AGTCTTGAGT AAGTCACGAT TTCTCTGAGA AATAACCACT 9180 

CCTCCTCCAG TTGACACGAC TTGGTCTGTT TGTAGTAAAT CAGCTAGGAC TTCTGATTCT 924 0 

ACCTGACGAA AGGCTGTTTC TCCCTTTTCA GCGAAAAAAT TCGCAATGGA CATACCTAGG 9300 

CGATTCTCAA TCAGAGCATC CATATCAAGG TAATTAGGGT CCAAGCCTCT TGCAATAGTC 93 60 

GATTTTCCAG CCCCCATAAA CCCTAATAAC ACCTTAGCCA TGAATCAAGC TCTCCAAATC 942 0 

ATCAAAGAAA CTAGGATAGC TGGTATTGAT GGCTTCTGCA CGGTCAAGCT CCACCTCTCC 948 0 

ATCTGCAACC AAGAGGGCTG CGATAGCTGT CATCATGCCG ATACGGTGGT CACCAAACGT 9540 

ATTGACTCTA GCACCGTGAA GAGCTGATTT TCCTTTGATA ATCATCCCAT CTGCCGTAGG 9600 
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AGTAATATCT GCTCCCATAC TATTTAAGGC GTCTGCCACA ACCTGAATAC GGTCTGTTTC 9 660 

CTTGACCTTG AGCTCCTCAG CATCCTTGAT AACTGTTACA CCTTGGGCTT GGGTCGCAAG 9 72 0 

CAGGGCAATA ATGGGCAATT CATCAATCAA TCGTGGAATC AAAGCGCCAC CAATCTCTGT 9780 

TCCTTTCAAG TCAGAAGACT CAACAATCAA GGTAGCAGAT TTAGCGACTG GATCGATTTC 9840 

AGTTATTTCC AATTTTCCAC CCATGGCACG AATGACATCA ATAATACCGG TGCGAGTTTC 9900 

GTTGATCCCC ACATTCTGCA GCACTAGACG AGAATTTGGA GCAATCAAAC CTGCGACTAA 9960 

CCAAAAGGCT GCACTGGAAA TATCTCCTGG TACGACCACC TTCTGTCCTG TCAATTTTTG 10020 

TGGCCCCTGG ACTGTGATTT TCTTACCATC CACACTTAAA TGACCACCAA ATTGTTTCAA 10080 

CATATCTTCA GTATGATTAC GGGTGTACTC TTTTTCGATA ATAACTGACT CCCCCTTAGC 10140 

TTGTAAGGCT GCAAACATCA AGGCTGACTT GACTTGGGCA GAGGCAATTG GCAACTCATA 10200 

ATGAATAGGT CTTAGGTTTT TCGTCCCTTT TAAGCGAAGG GGAGGCAAGT CTCGTTCAGT 102 60 

TTGCCCTGAA ATGCTGACGC CCATTTTTTT CAGTGGAAGG GTCACACGGT CCATAGGACG 1032 0 

TTTGGAAAGA CTATCATCTC CAAACATCTC TACTTCGAAA TCTGCACCAG CAAGGACACC 103 80 

TGAAATCAGG CGAATCGAGG TGCCAGAATT TCCCATATTA AGGGCATTTT GTGGCGCTTT 10440 

TAAGCCAGCC ATGCCTACAC CTTGAATGGT AATAACCCCA TCTTTATCCT CAATTTCAAC 10 500 

ACCAAGGTCA CGAAAAACCT GCATGGTCGA AAGAACGTCT TCACCTCGCA GAATATCATA 10 5 60 

AACCTTGGTC TCACCCTCAG CCAAACTTCC AAAGATAATG GAACGGTGGC TG AT AG AC T T 10 620 

GTCACCTGGG ACGCGGATAC TACCATGTAA ATGGCGAATG TTTGTTTTTA GTTTCATACT 10 680 

GGACCTCATA CTTGCAATAC TTTTACCTAT TTTATCATAA AAAGCCAGAA ATTCCTTAAA 107 40 

AATTCCTGAC TTTAGGATCG TTCTTTTCTT ATTTCAGCAA TTCTGAAACT GGTTCAAAAA 10800 

CAATTTTTTC AATATCAGAA AGGTAAATGG CCAATTGTTG TTGCTTGGTA AAGAATTCTG 10860 

ACAAGAGGCT ATTTCCTTGA ATCTGTTTAC CAAAGCCTTC CAT C T T AG CT TGGAAGGACG 10920 

CATCTGGCAT TTGACCTGTC TGTGCTAGTT TTTGAATTTC CTCTTGAAAG G C AAG AT AAT 10 980 

CTGTAAAGAT TTTGCTTGCC TCAGCATCTG CTGCAATCGC ATCTTTAGCT GCTTTAACAG 11040 

CCTTGTATTC TGGTAATCCG CGTAGACCGC GACTGAGTTC GTTTGCACTA TCGTAAATAT 11100 

TTGACATGTT CTTCTCCTTA TTTGATGACG ACTGTATAGT CAGTATTTTC TGTTATGAGA 11160 

TGCTCAGCTC TTTCCAAGTC TTGAGCATTT TTAAATGAAA TTTGTAGGAT TCCGTGAATA 1122 0 

TCCTCACGAT TTTCCTCGTT GATGTGGATA TTAACCAAGG AAGTTCCACG TAGCAGTTCC 112 80 

AAAATCCGCA GGATGACATC TTCTTCATCA GGAACGTCAA CATAGAGGTC GTAAGAGCTA 11340 

TCCACACCAC CACGCTTATG GATTTCCATG GTCTGGCGTT GTTCACGCGC TTGGTTAAAA 11400 
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AAGTTCCAAA TTTGCTCTTC ATCTCCCTTA CTAATGGCCT GACCAATCGC TTCCAAACGT 114 6 0 

TCCTTGAAAT CCTCAATTCT ATCCAGAATG ATCTCGCTAT TGGACAAGAG AATGGAGGTC 11520 

CACATTCCTG GCTCGCTTTC CGCAATTCGG GTCATATCTC GAAAACCACC TGCCGCAAAG 11580 

CGCCTTGCCA TCTCATGCTC TTGAGCATAG ACCGCAGTCT GCTCCATGAG ACTAGAAGCC 11640 

AAAAT AT GAG GAAAATGGCT AATCTGAGAA GTGACACGAT CATGCTCCTT GGCATCAATC 11700 

TCGATAAAAC GAGCATGAAG ACCTGAAAGC AGATCCTTCA TTTCCTTAAG CGTGTCCTGA 117 60 

CTTGTCAGGC TTGAAGGTGT AAAGATATAA TAGGCATTTT CAAAAAGATT GACATCTGCC 1182 0 

GAAGCAGCCC CTGTCTTGTG ACTACCAGCC ATGGGATGGG CCCCGACAAA GCGAACAGAC 118 80 

TTGCCAGCCA AATACTGCTC CGCCGCATCC AC AAT GGTTG ACTTGGTCGA ACCAGCATCT 1194 0 

GAAATAATAA CGCCTTCTCG CAAATCCAAA TTGGCCAACT CCTTAATGAA AGCAATAGTT 12 000 

TGTTTGATTG G C AAGCTG AG GATAATGACA TCTGCCAAAG GAGCAAAACT AGC AAAAT C A 12 0 60 

TCCGTTGCAC GGTCAATCAT ACCTTCTTTC AAGGCGATAT CTCTCGAAGC TTGACTACGA 1212 0 

TTATAACCTA AAATTTCATA AT C T GG ATGA TCGCGTTTGA TACCAAGTGC CATAGAGGCT 1218 0 

CCAATCAACC CAAGACCTGC GATATAGATT GTTTTTGCCA TAGGAACTCC TTAATAGTTC 1224 0 

TTTGTATAGT CTCGGTGTTT GGCTACCGCT TCTTTTAGTT CCTCAAGATT ATCTGATGAG 12 300 

AATTTTTCGA GGATTTCTTG CGCCAGAACC GTTGCTACAA CTGCTTCCAT GACCATTCCT 123 60 

GCAGCTGGAA GAGCAGTCGG ATCACTTCTC TCCACGGTTG CCTTGTAAGG TTCGTGGGTT 12 42 0 

T CG AT AT CCA CACTCATAAG AGGTTTATAA AGAGTAGGAA TGGGTTTCAT GACCCCACGA 12480 

ACAACGATGG GTTGCCCATT AGTCATACCA CCTTCAAAAC CACCTAGATT ATTGGTACGG 12 540 

CGAGTATAAC CGTCTTCTTT AGACCAGAGA ATTTCATCCA TAACTTGGCT GCCTTTACGA 12 600 

TAACCAGCCT CAAAGCCAAG ACCAAATTCC ACCCCTTTAA AGGCATTGAT AGAGACAACA 12 660 

GCTTGAGCCA ATCTTGCATC CAATTTTCTA TCCCATTGGA CATAGGAACC AAGACCAACT 1272 0 

GGAACGCCTC CGACGACTGT CTCCACAACC CCACCGATGG TATCACCATC ACGTTTGATT 127 80 

TGGTCAATAT AGTCCTTGAT TTCCTGTTCT CGTTCTTGGT T G AC AAT AG A AACTTCAGAC 12 840 

TGGGCAGCTC TTTGCTTAAT TTCAGCGACT GTCAGATTTT CAGGAACATC GATTTCCTTG 12 9 00 

CCACCAAAGA CCACGACATG GTTGGCAATC TCCATATCCA GCTCAGCCAA GAGGCGTTTG 12 960 

GCTACTGCAC CAACTGCCAC CCGCATGGTG GTTTCACGAG CTGATGAACG CTCCAAAGAA 1302 0 

TTTCGCAAAT CATCAAAACG GTACTTAATC CCCCCAACCA AATCGGCATG ACCTGGGCGA 13 080 

GGATGAGTAA TTTTCCGCTT GCTTTTAAGG CGGTCTTCAA TGTCCTCCGC AGACATGATG 1314 0 
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TCCAGCCATT 


TCTGGTGGTC 


CTTATTGATG 


ACATCCATAG 


TAATAGGCGC 


CCCTGTCGTC 


13200 


TTCCCGTGGC 


GAACGCCCGA 


AGTAAAGACA 


ACCTGGTCAT 


TCTCAATCTT 


CATACGACCA 


13260 


CCACGACCGT 


AGCCACCCTG 


ACGGCGTCTA 


AGGTCCTCAT 


TG AT AT C CT C 


AGCTGTCAAT 


13320 


GG AAGTCCAG 


CTGGAATTCC 


CTCAATAATA 


GCTGTTAGAC 


GGGGGCCGTG 


TGATTCTCCT 


13380 


GCAGTTAAAT 


ATCTCATACA 


CTCTCCTTAT 


TTTACCAAGT 


AGTCTTTCAT 


CTCTTCCAGA 


13440 


GAAACTGGGT 


GAATGGTCGC 


TGAACCAAGC 


TCTGGCACCA 


AG AC C AATT T 


CAAGGTGTTA 


13500 


CCACGCGCTT 


TCTTGTCATG 


AGTAAGAGCC 


TGATAAAGCT 


TGCCAACTTC 


CCAATTTTCA 


13560 


TAGTCAACAG 


GCAAACCGAA 


TTTCTGACAC 


ATCTCTGTGA 


TAGATTGGGT 


AATGCCAGCT 


13620 


GGCATGAGGC 


CTTTTTCCTC 


AGCAACCTTG 


GAAATCTGTA 


CCATTCCCAT 


GGCAACAGCC 


13680 


TCTCCATGCA 


TGACCTTGCC 


ATAACCGGCA 


GTCGCTTCGA 


TGGCATGGCC 


AATAGTGTGG 


13740 


CCAAAATTGA 


GGTAAAGACG 


AATACCATTG 


TCCAACTCAT 


CTTCAACCAC 


CATCTTGCGC 


13800 


TTCACCTGAC 


AAGAATGTTC 


AATCAAGGTC 


TCTGCATGTT 


C C AAAAT ACT 


CTCAACAGAA 


13860 


CCATTCAGTC 


CCGTCAAGAG 


AGCCCACAGT 


TCTGGATCCT 


CAATCAAGCC 


ATACTTGATA 


13920 


ACTTCACCCA 


TCCCTTCAAT 


CAACTCTCTT 


TTTCCGAGGG 


TTTCAAGAAC 


AAGTGGATCA 


13980 


ATCAGAACCC 


CATCTGGTTG 


GGCAAAGGTC 


CCCACCATAT 


TTTTAGCAAA 


TGGTGTATTA 


14040 


ACGCCTGTCT 


TTCCACCGAT 


AGAAGAATCA 


ACCTGAGCTG 


TCAAACTAGT 


CGGAATCTGA 


14100 


AC AAAGT G AA 


TACCCCGCAT 


ATAGGTAGAG 


GCTACAAATC 


CAGCCAGGTC 


CCCAACAACG 


14160 


CCACCACCAA 


GAGCAACGAT 


TCCATCGCTA 


CGAGTCAGAC 


CTTGCTTGAC 


TAGAAATTCA 


14220 


TAGACTTTCT 


GAACAGTAGT 


TAAATTCTTT 


CTTTCTTCAC 


CTTCTAAGAA 


AT C AAAAAC A 


14280 


GCTACCTGAA 


AACCAGCATC 


TTCTAGGCTG 


AGCTTGACCT 


TCTCTGCATA 


GAGAGAGGCT 


14340 


ACATGGTTAT 


CTGTCACAAT 


GACTACCTTT 


TGCGGTTGCC 


AGAGTTCTCG 


CAACCACTGA 


14400 


CCAGCCTGGG 


CCATACAACC 


TTTTTCAATC 


TGAATATCAT 


AAGGATGGTG 


AGGAATATCG 


14460 


ATTCTGATTT 


TCATAGGAGA 


GTCTCCCTTT 


CTTTATTGGT 


ATTTTTCTGT 


TAAAGACTGC 


14520 


CAAATCTCTT 


CTGTCGGCAT 


TTCCTTGCCT 


GTCCACAGTT 


GAAAAGCTTC 


TGCAGCTTGA 


14580 


TAGAGTAACA 


TTCCCAGACC 


ATTGACTGCT 


GGATTGCCCT 


GACTTCTAGC 


CCATTTCAAA 


14640 


AACGGTGTTT 


CAAAGGGTTG 


GTATATGATA 


TCTGCAACTA 


AAAGAGTTTC 


TGGTAAGACT 


14700 


ATGTTTTCAG 


GAACAGGAGA 


GGATTGGCCA 


TCCATGCCCA 


C AC TGGTGG C 


AT T AACT AG C 


14760 


AAATC CG ACT 


CGGCAATCCT 


TGCTTGCAGT 


T C AG AAAC AT 


ATTCTAAAGC 


AC AC AAATC C 


14820 


ACTTTAAAAC 


CTGTCTGCTC 


CTGTAACTTG 


TCTAGGTAAG 


GTCTTGTTTT 


TTCCATAGAA 


14880 


ACGGAACGAA 


CAAAG AC CG A 


AATCTGACTG 


ACGCCATCCA 


AAATAGCCTG 


TGCCAAGATT 


14940 
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GATTTAGCCG CACCACCTGC ACCCAGCAGG GTCATCTTTT TACCTGAAAT TGTAAAAGAA 15000 

GGC AAG C ACT TAAAAAATCC CTTGCCATCT GT AT TAT AT C CAATTAAATT GCCATTCTCA 15060 

TTGACAACCG TATTAACCGC ACCAATCAAG CGCGCTTCAT CGCTCAGCTT ATCCAAATAA 15120 

GGAATCACCT GCTCCTTATA GGGCATGGAC AGATTGATGC CAAACATCTG GTAGCGACGA 15180 

ATATTGGCCA CTGTTTCTAC CAAGTCACTC GCTTCAATCT CCCAAGCCAC ATAAGCACCG 15240 

TTGGTAGCTG TCGCCTCAAA GGCTCTATTG TGGATGAAGG GAGAAATAGA ATGCTTAATA 15300 

GGATTGGCAA CAACTGCAGC TAAACGTGTA TAGCCATCAA GCT T CAT CCA AAATCTCCCT 15360 

GATTTTTTTC ATGCTAGCTA GAGAAATCTG CCCAGGGGCA CTAACCTCAT CCAGACTGGC 15420 

AAAAGACCAA CTCGAACCAG TCACATCCGC AGTGATACGA GAGACCTTGC CCACCTTACC 15480 

CATAGAAATG GTCACATATT CCTGTTCAGG ATTGAGGGTT TTAAAGCCTC GTGTATAGTT 15540 

CATCAAGTCT AAGAC AT C CT GCTCCGTGTG AGCCATCACC GCAACCTTAA CAAGTTTTGG 15600 

ATTTAGGATC GTCAACTCTG ACAAGATTTC CATCATGTTC TCAGGTGTTT CTTGGAAATT 15660 

ATGGTAACTC AAAACAAGAT TTGGGAAGTC CAGCATTTCC TCAAAAACAT CCTTGTAGCT 1572 0 

AT AGT AC TC A AAATCAATAT AGTCTGGTTG ATAGAGTTGC GCAACTTCCT TGATTAGATG 15780 

GATATACTCT TCTGGAGAAA GGTCGATTTC TCCACCTTCG GAGCGAGTTC GTAGCGTGAA 1584 0 

AACCAACTCA CGGCCTGCGA ATTTTTCAAA AATGGCTGGA GCTACCTGCA AAATCGCTTC 15900 

TTTAGGCAGA TAGTCGGCAC GCCATTCAAT GATGTCGGCA TCCAGGTACC TCGTGGCATC 159 60 

CAGAGCCTGA GCCTCCTCTA AACTTCTTGG CATTACTGAA ACGATTAATT TCATTTACTA 16 020 

ACCTT C AT AC TAATCACCTT GAGGTAATTA CTACTTTCAT CTTTTTTATT ATAGGCAAAA 16080 

TCTGCTGGAA GACCATATTT GTTTAAAATC TGGTAACTTC TTCCTGCAAA ACCTTTATCA 1614 0 

ATTTGTTCTG TAAATTTCTG ACGGGAAACA TTGGCAGCAT TGGTACTGGC AATGATAATC 162 00 

CCTCCCGGAT TTAAAATCTC AAGACTCTGG GAAATCAACT TGTGATAATC CTTGGCCACA 16260 

GAGAAAGTTT GTTTTTTATT CCGAGCAAAG CTAGGCGGAT CTAGGACAAT CACATCGTAG 16320 

GTCAAGTCTT TGCGTTTGGC ATATTTGAAA TACTCAAAGA CATCCATGAC TATAAAACGA 16380 

TGCTCGTCTG TGCTGAGCCC ATTTGCCTGA AAATGCGCTT GAGACAATTC TCGTGAACGT 1644 0 

TTGGCTAGAT CAACAGAAGT TGTATGGCTA GCTCCTCCCA TGGCCGCAGC TACTGAAAAA 16500 

GCCGCTGTGT AGGAAAACAT ATTGAGTAAG GATTTACCCA TAG C C AAG C C GTCAACTAAA 16560 

CTACCGCGAA CCTCATGCTG GTCTAGGAAA ATTCCTGTCA TCAAGCCATC ATT CAT AAAG 16 620 

ACTTGATACA GGACACCATT TTCTAAAACA TTGAAAAAGT CAGGTGCTTC TTGACCATAA 16680 
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ACATGGGCAG ATTCATAGTC CAAACCCTTA AAGCGGATTT TCTCATAAGC TCCTAAAACC 16740 

TCAGGGAAAA CCTGTCTAAA GGCTTCTGAT ATAGTCTGAC GAATCTGATA AACATAAGAG 168 00 

TTATACCAAG AAAAGACGGC GTAGTCGCCA TAAAGGTCCA CTGTCAGACC CCCAAAGCCA 16860 

TCTCCCTCTT GATTAAAGAG ACGAAAGGCA GTTGTCAAAT CATCTTGATA GTAGGCGTTT 16920 

CTCTTTTCTT TGGCTTTTCT AAACAACGTT TCAAAGAAAG CTTGATTGAA GGCCACCTTG 16980 

TCTTTGCTGA TAAACCAGCC CAAGCCCTTG TTTTGCTGAG AAAGGTAGGC AGTCCCAAGA 17 040 

AAGTTTCCTT CCTGACCCTG CACCTCTACT TCCTGATCCT TAAGATTGAC ATTCTCAAGA 17100 

TCACTGGCTT CTAGTAAAAC TAGCCCCTTA GCAAGCTTCT TTTCAACCCT TTTGCTGACT 17160 

CTTATTCTAT TCATAACTAC C ATT AT AT C A AACTTTTAGA CAATTCTCAA AAAAGAAACT 1722 0 

ACCCTTGCTT TTTTACTCTT CTTTTAAAAA ATGGTATACT AGACTTCCTG CAAAACTAGG 172 80 

AAGTAAATGT GTAAGAATCA CAGTAAAAAA TGCTCTTCCG TCTTGGAGGA GCATTTCTTT 17 340 

TTATCAACGA AAATCAAATA GCAAACTATG AAAC TAG C CT CAGGTTAACT GTGAGATTAT 174 00 

AGGTAGAGAG GTTGTATCAG CAATATGTGT CTGTCAAATT TAGTGACAAA GGTAGTAGAA 174 60 

GAAAGATAAA GAAATAAATC AGCTTCAGTA GGTATCTGGA AAATTTGATT TTATAGAGAA 17 52 0 

GCCTTTTGTT ACAAACTCAA TAT ACT AT C A ATAAATAATA TTATAGAAGC AACAATAATT 17 5 80 

ATAATTTCAC CTATCTGCAT CATTCTATTT CGAACTCTAA ATATATGTTC T AT C AAAAAT 17 640 

ACTTGGAACA CACACATTAT AGGAATTAAC GTTTTTGAAA T T G AAAAAT A TCCAAATAAA 177 00 

TAAACTATAA ACAACAAAAA TAGAACTATG TT AT ATT T C T TATTCAAAAC ATTCCTCCCT 177 60 

ATATATTTTT GATTACCAAT CTTAATCATT TACAACTACA TTCTAACAAA CT ATAAAAG C 17 82 0 

GTTTGTCGAA TTGAATTTAT CAAGCAAGCG ACCAACCAGT TCATCTTTTT TCTATTTCTG 17 880 

CCAATATGCG TGACAGGTAA T AATGAT AG C CAAAAATAGC AAGAGCAAGC AAGACGATAA 17 94 0 

GAGCTCCTAC TCCCAAGCTG AT GGC AAGG A TAGGGGAGAG AGACTGAACC AAGAATATGC 18000 

TCCCAATTAC AAGGG C CAT C AGGATTGCAC TATAAATAAA CAATAAAACT ATGGCGAC f A 180 60 

TGCCATTTGA ACGATTCACC AGGTCCGTAA TGCTACTCCA ATTGGTTGAC AGATTTTTAA. 1812 0 

CGTCCTTAAA GTAATGGTGG CAAGAAAGGA TGACACTGGC AATGATCCAG ACTACAAGAA 1818 0 

GGT AAAT CAT CGAAATGATG GGCAAGCCTA GATATAGAGA AAGACCAAGC AAAGTCAGAA 18240 

CTGGTAAAAA GG ACT GG AC A GCATATATAA TCCAAAATTT CACTTTCACA TAACGAGCAA 183 00 

AGTCAAAGGG TAAACTCTTA AGAAAATCAA CATTTTCCCT CTCCAAGGAC AAGGCAATTG 183 60 

AATGCAGGCT GGTGATATTG TTATTGACAA CTGCTATAAA GAGAGCTATA AAAAACAAGG 18420 

GTAACCAGTA TGGAGGATGA ATGTCTGGAA CTATCTGAGA ATCTCGGATT TTGGAAATCA 18480 
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GACCGATCAT CATGAGATAA GGAAGGAAAG CACTTGTAAA AAGCACTGTA ATCACGCCAG 18540 

TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GT AAAAC CAT GCGAAAAAAT CCCTTTTTAG 18 600 

TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 186 60 

GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAAGC AGGAAGAGAG 1872 0 

TTAGTCCTAT CCAAGCGAGC AGACCCACTA AGGCTTCTGT CGAAAAAGGC TCCACTGCTA 187 80 

TTTTGTAAAA GATATGAAGA GGATAAAGGA GAAATGGAAT GTCTCTAACT TTGTCAACAA 18 840 

TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA AGGTATGAGA ACTCCTATCC 18900 

CAATCATCAC ATTCGAAAAA ATAGACTGAT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 18 9 60 

AATGCACTGC CACTACCATC ACTAGAGCCA CAGAGACAAA TAATAAGGTC AAGGACAGTA 19020 

GCATCAAAGG CAAACCCAGC CATAGAGAAG GAG C TAG C CT AATGTAGAGG ACCAGAAAAT 19080 

AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG GACAGACAGT CCTTTAGCAA 19140 

TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 192 00 

AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC AATCACTAGT AAAATAGCAA 19260 

TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA ATGAATGGCT AT AT TACT AA 19320 

AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT TAAGACTCTA GCGGAAACAT 193 80 

TTACTTTTTT 193 9 0 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AGAAATTCAA CATATAAAGG 6 0 

AAAATATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACAGGACTTC 12 0 

TCAACGACCC AAATGGCTTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTTAAAAT CTTGGGCACA GCTAGAAAGT GATGATTTGA 24 0 

TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT AGCCACGGTG 3 00 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT ACAGGAAATG 3 60 

TTCGCGATAA AAACTGGATC CGTCACCCAT AC C AG AT CGG TGCTTTGATG GACAAGGAGG 420 
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GTAAGATTAC AAAGATTGAC AAGATCTTGA TTGACCAGCC AGCAGACTCT ACTGACCACT 480 

TCCGCGATCC ACAAATTTTT AACTTTCAGG GTCAATATTA TGCCATTGTC GGCGGACAAG 540 

ACTTGGAGAA AAAAGGTTTC GTTCGTCTCT ACAAGGCTGT CAATAACGAC TACACAAACT 600 

GGCAAGCAGT TGGCGACCTT GACTTTGCTA ACGACCGTAC TGCCTACATG ATGGAATGTC 66 0 

CTAATTTGGT CTTTGTAGAG GAACAACCTG TCCTTCTCTA CTGTCCACAA GGATTGGATA 720 

AGAAAGTTCT AGACTACGAT AATATCTTTC CAAATATGTA TAAGATCGGG GCTTCCTTTG 7 80 

ACCCTAAAAA TGCCAAAATG GTAGATGTGT CTCAACTTCA AAACATGGAT TACGGTTTCG 84 0 

AAGCCTATGC AACTCAAGCC TTCAACGCTC CTGATGGGCG TGCTCTAGCA GTTAGCTGGC 900 

TTGGTTTGCC AGATGTTTCT TACCCATCTG ACCGTTTTGA CCACCAAGGA ACCTTCTCTT 9 60 

TGGTCAAGGA ACTC ACT AT C AAAGACGACA AGCTCTACCA GTATCCAGTC GCTGCTATTA 102 0 

AGGACCTTCG TGCTTCTGAA GAAGCCTTCT CAAACCGTTC CCAAACCAAG AACACTTACG 10 80 

AACTTGAACT CAACTTGGAA GCTAATAGCC AGAGCGAGAT TGTCTTACTT GCTGATAAAG 114 0 

AAGGTAAGGG ACTTTCAATC AACTTTGACC TTGTAAACGG TCAAGTAACA GTGGATCGTA 12 00 

GCCAGGCTGG AGAACAGTAT GCCCAAGAAT TTGGGACAAC TCGTTCTTGC CCTATCGAGA 12 60 

ATCAGGCTAC TACTGCTACA ATCTTCATCG ATAACTCTGT CTTTGAAATT TTCATCAATA 13 2 0 

AAGGAGAAAA AGTATTTTCT GGTCGTGTCT TCCCACATGC GGACCAAAAT GGTATCCTGA 13 80 

TTAAATCTGG AAACCCAACT GGAACTTACT ATG AAT TAG A TTATGGTCGC AAAACTAACT 144 0 

GATGTCGCCA AACTTGCAGG CGTCAGTCCT ACTACCGTTT CTCGGGTTAT CAATAAAAAA 1500 

GGGTATCTAT CTGAGAAAAC CAT C C AAAAA GTCAATGAAG CCATGCGAGA ATTGGGCTAT 15 6 0 

AAACCCAACA ACCTGGCTCG TAGTCTGCAA GGAAAATCAG CTAAGTTAAT CGGCTTGATT 162 0 

TTCCCCAATA TTTCCAATGT TTTCTATGCA GAATTGATTG ATAAATTGGA ACACCAACTC 1680 

TTCAAAAATG GTTACAAGAC CATCATCTGC AACAGTGAAC ATGATTCTGA GAAGGAACGC 174 0 

GAAT AC AT CG AAATGTTGGA AGCCAATCAG GTGGACGGCA TCATTTCTGG TAGTCACAAC 1800 

CTAGGAATCG AAGACTACAA TCGTGTGACA GCGCCGATTA TTTCCTTTGA CCGAAACCTA 1860 

TCGCCAGACA TCCCTGTCGT CTCCTCTGAC AACTATGCTG GTGGGGTTCT TGCTGCCCAA 192 0 

ACCTTGGTCA AGACAGGTGC CCAGTCTATC AT C ATG ATT A CAGGGAATGA CAATTCTAAT 1980 

TCGCCAACCG GACTGCGCCA CGCTGGTTTT GCATCCGTAC TCCCAAAAGC TCCTATTATC 2040 

AATGTTTCCA GTGACTTTTC TCCCGTCAGA AAAGAAATGG AAATCAAGAA T AT CTTG AC C 2100 

CGGGAAAAAC CAGATGCCAT TTTTGCTTCG GATGATTTGA CAGCTATTCT GGTCATTAAA 2160 

ATCGCTCAAG AATTGGGCAT TTCTGTCCCA AAAGAGCTCA AGGTCATCGG CTATGATGGG 222 0 
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ACCTACTTTA TCGAAAATTA CTACCCTCAA TTGGCTACTA TCAAGCAACC TTTGGAAGAG 22 8 0 

ATTGCTTGTC TCACTATTGA TCTTCTCTTG CAAAAGATTG AAGGCAAGGA AGTCGCCACA 2 3 40 

ACTGGTTACT TCTTACCAGT TACGCTATTA CCAGGAAAAA GTATTTAAAC ACAAGAAAAC 24 00 

TCAGACCGAT TCGTCTGAGT TTTTATGATC TTAAATTTTC G AG AT AG CG C TGGGCTGTCT 24 6 0 

CTAGGTTAAA GGTTTTATCT GAGATGAGGC GCTCTACTAG GGGAGCAACT T C AG AT T C AC 2 520 

TAGCCCCAGC TAGGAGAGCT AGGGATTTGG CCTGTAGTTT CATGTGGCCT TGCTGGATGC 2580 

CCGTACTTAC CAAGGCTTTG AGGGCTGCAA AATTTTGAGC AAGACCGATG GACACGATAA 2 64 0 

TCTGGGCTAA TTCTCTGGCA GAAGGATTTC CTAGTAGATC ATGACTGAGA ACTACACGTG 2 700 

GGTTGAGGCC G AT AG AGC C A CCCTTAGTCG CTACAGGCAT GGGCAGGGTC ATCTCACCGA 27 60 

CCAATTCTTC TCTTTCAAGG TCCAGCGTCC AGCAGCTAAG ACCTTGATAG CGTCCATCTC 2 820 

GACTGGCAAA GGCATGGGCC CCAGCTTCGA TGGCACGCCA GTCATTACCA GTGGCAATCA 2880 

AAATCGCATC AAT AC C ATT A AAAATTCCTT TATTATGAGT AGCAGCTCGG T AAGG AT C AG 2 94 0 

CCTGCGCAAA CTGACTAGCC AACGCAATTT TCTCCGCAAT CTCTCGTCCT TGATCCTTTT 3 0 00 

GGCGGCTCAA GTAGCGAAAG GCGATGCGAC AGCTTGCAGT CACCAGAGAA TCGGTCGCGT 3 060 

AGTTGGACAG GATTCCCATG AGACTCTGTC CCTGACTGAG TTCTTCTAAG ACTGGTTTCA 312 0 

AGGCTTCCAG CATGGTGTTG AGC AT ATT GG CACCCATGGC TTCCTGGGTA TCGACATGAA 318 0 

TATAAACAAC GAGAAAGTCT GGTTCGCCTT TTATCTGCTC GACATGCAGA TCACGCGCCC 3 2 40 

CACCTCCACG TTTAACGATA GAAGGATAGG CTTGATTGGC AAGCTCCAAG AGCTCCGCTT 3 3 00 

TCTTGCTGGC AATCTTCTCT TGCGCTAGTT T AGG ATT AG C AACTTGATAA AGGGCTACCT 3 3 60 

GCCCAATCAT CTGTCGCTGA TGGACTTGTG CAGTAAAACC ACCTGCACGC TTGATGATTT 342 0 

TGCTGGCATA GCTGGCCGCC GCAACCACAG AGGGTTCTTC TGTCACATAG GGAACGGTGT 3 4 80 

ATTCCTGACC GTTGACAAGT ACCTCCGGAA CCAGTGAATA AGGCAGAGAA AAAGTTCCCA 3 540 

CT AC ATT CT C ACTCAGCTGG TCTGCCACAG TCACGCTCAT CTGTTCATCC TTCTCCAGAC 3 600 

TAGCTTGTCT CTC AGG ACTA AGGAGCGCCT GAG CTTTT AA CAGCTCGAGG CGCTCTTGGT 3 6 60 

ATGATTTTTT AGAAAATCCA TTCCAACTTA TCTTCATTAT TTTTCAACCT TGCTATAACG 3720 

GCGTTGGTGG TCGAGAATTT CAACCAAGGC AAAATCTTGA TTTTCATAGC C AG C AAACTG 3 7 80 

GGCAGAGTTA GTTTCATCCA AGTTTACTTC CTCAAAAAAG ACCTTTTCAT AGTCTGCAAC 384 0 

GGATAGGGCA GTTCGTTGGT TGAGCTTGTT CAAACGGTCT TTATCCAAAT AAGC T T CAT A 3900 

TCCTTCAACC AATTCACCAC TGAAGAACTC AGCCACAGCT CCACTTCCGT AACTATAAAG 3960 
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GGCGATTTTA TCCCCAGCTT TCAAGCTATC TGTATTTTCC AAGAGAGACA AAAGTCCAAG 4020 

GAAAAGTGAA CCTGTGTAGA TATTCCCCAC CTTTTGACTG TAGAGAATAG ACTGGTCAAA 4080 

ATGCTTTTGT AAGAGGTCTT TTTTCTCTTG AGGCAGGCTC TTATCCATGA TTTTTTTCAA 4140 

GCCTTTTAGC GCTAATTTAG GATAAGGCAA GTGGAAACAA ACAGCCGCAA AATCATCCAA 4200 

AGTAAGCTGG TAGCGTTTTT GATATTCAAG CCAAGTCGTT TTCAAACTAT CCAAGTATTG 42 60 

TTGGGTAGAA TAGACACCAT TTACATAAGG AGTTGTCGAG TAATTTGGTC GCCAGAAATC 432 0 

CATGATGTCA CGGGTCTGAG CTACATTGTC ATTATTAAAG G C CATC AT GC GTGGATTTTG 4380 

TGTAATCAAC ATAGCTACAC TTCCAGCACC TTGAGTTGGT TCTCCTGGAG TTTCAATACC 4440 

GTATTTGGCA ATATCACTGG CAATGACCAA GACCTTGGAC TCCGGAGAAT TTTCCACATG 4500 

CAATTTGGCA TAATGGAGGG CAGCAGTCGC TCCGTAGCAG GCTTCTTTAA TCTCGAAACT 45 60 

ACGAGCAAAG GGCTGGATGC CCAGCAAGCC ATGCACAAAG ACGGCCGCAG CCTTACTCTG 4 620 

GTCAATTCCT GACTCGGTCG CCACAATGAC CATGTCAACT TCTTGTCTTT CTTGCTCAGT 4680 

TAAAATAGAG TCACTAGCAC TGGCCGCCAA GGTCACGATA TCCTCAGTTA GGGGCGCAAT 4740 

ACTCAATTCC TTGAGTAAGA GTCCTTTACT TAATTTTTCA GGGTCAATTC CCCTCGCTTC 4800 

TGCTAAGTCT TGTAATTTCA AG AC AT AT T G ACTGGTCGCA AAACCAATCT TATCAATACC 4860 

GATTGTCATA TTTACCTCTG TTTTATCATT CATGTAAAAA ATCGTTCTAT ACTATTTTAT 492 0 

CACAAATGGC AGTAAAAGAG AGAAAAAAGA CTTGATTCAC CAAATCAAGC CTCTTATTGG 4 980 

T C AT C ATT T T AAAGAATGAT TAGTTGCTAG AGAGTTCACC GATATAAGTA GCTTTATAAG 5040 

CTCCATTCAC AGTTATCAGC TCCTGGAGGA TCAAATTTCC TGAGTAAGTC CTTCCCATCT 5100 

CATCTACAAA TTTTTGATAA AACTGACTGG TCGGAATTTC TCTGACATCC TTATCAAATG 516 0 

TCTTATCAAG TGTTTTACTA ACCTTCTCAG CAATCAATTG ATGCTCTTGC CATCCACTTT 52 2 0 

GAAACTCTGA GCCCGAACTA GAAACCATGA CTGGGATAAA CAACAAGGTC AGTAGATTTA 52 80 

CAGACAATAA GGAAAGTAGT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 534 0 

ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 54 00 

GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATGTTCT CAACCTTGCT TCTCTCCTTA 5460 

GATAGCGCAT GGTTACAGGC TTTATCTTCA GCTGTTAGCG GCTTGAGTTT GCTGGATTTA 5520 

CGTGGAGTTT GTGCTTGAGG ATATATCTTC ATGAGCCCTT GATAATCACT GTCAGCCAAG 5580 

AT T T T AC C AG CTTGTCCGAT ATTTCTGCAA CTCATTTTGA ACAACTTCAT AT C ATGACT A 5640 

TAGTTCACAG CGATATCCAA AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTGAGCC 57 00 

TTCATAGCGT GAAATTTCTT TTTACCAGAA TCATTCGCTA ATTGTTTTTT AGGGCGATTG 5760 
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ATTTTTACTT CCGTCACATC AATCATTATC GTGTCCTCAA AG CTG AG AGG AGTTCTTGAA 582 0 

ATCGTAACAC CACTTTGAAC AAGAGTTACT TCAACCCATT GGCTCCGACG GATTAAGTTG 5880 

CTTTCGTGGA TACCAAAATC AGCCGCAATT TCTTCATAAG TGCGGTATTC T C GC AC AT AT 5940 

AGAAAGCGTT ATCAATTTAT TTATCTCATT TTTCAGAAAA TTCTTTTATT TCTGTAAAGT 6000 

CTACGATACT CGATGTGTTT TTATATAATG ATAGAGTCTG AGAATCACTG TTCCGCTAGC 6060 

CATTCCAATA GAGATTACCA AAGCCAACAT GACAACCAAG GTCGCACTTG CCAGTGCTTT 612 0 

ATTATAGTCC CCTGTCACAA AAAAGGCAGT TGTTCGGTAG GAGAGATAAC CTGGAACCAG 6180 

CGGTGCCAAA ATGGCCAAGA TAAAGACCAC AG C AGGTGT C TTATAAAGAA TACTTAAAAT 6240 

CTGGCTGACA CAAGAACCAA TAATGGCTGC AATGAAGGTA GCTACAATGA CATTGGTCGG 63 00 

TTCCTTGAGC AAGAGATAGA TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAG 63 60 

CATAGACCGT TGCACATTGA GTACGATTAA AAAAGTGATA ATGGCAAGAA AACTTGCTAC 642 0 

TGCTTGTAAT AAAAAGGTTG TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG 6480 

TTCCTGCCCC TAAAGCGAGG GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT 6540 

TTATGTGGTT GGTCATAATA TCACGGACCG CATTGGTCAA GGCAATACCT GGTACAAACG 6 600 

GCATGACCGC ACCAGCTATA ATCAAATCTG CCGTTGAAGG AAAACCTGTG TAGCGAGCCC 6660 

AAAACTGGGC AATTATCCCA AAGACAAAGG CTCCAGCAAA GGCTGTCACA AAGGGAATTC 672 0 

GGATAAATTT TTCCACATAG AGGGAAAAGG CAAAACCAAA TAAGGTCGCC ACTCCTGCCC 67 80 

CAAGTGCGTC GT AG AT AT T T CCGCTAAACA TAACTGAAAA GAAAGGAGCA CTAAAGGTCG 684 0 

CAGCCAGAGT TACCTGCAAC TTAGTATAGG GAAGGGGTTT GGCTTGCAAG GCCGTCAATT 6900 

GCTTAAAGGC TGTTTCTAAG TCAATCTGCC CCCCAACTAG CTGACGAGAA ATCTGGTTCA 6 960 

CAT CGC AG AC TTTTTCGATG TTATAAGAAG AGGAGGTCAC GCGCTTCATG CGCAAATATT 7 02 0 

GGTATTTTCA ATAGAGAAAA AGATAGCGGC AGGCATGGCA AGGACATTGC AATCCACAAT 7 080 

CCCCTGCGAA TGCGCGATTC GAATCATGGT ATCTTCTACA CGATGGATTT CTGAGCCACT 714 0 

TTTAAGGAGA ATAGTCCCCG CTAGCATAAT C AC AT C AATG ACGGCATTTA ATTCTTTTGA 7 2 00 

TTCTTCCATG CTTTCCTCCT TTTATCAACT CCCTCTATTC TATCACAAAT CCGGACTCAA 72 60 

AAAAAATCTT TGCCATGAAA T C ATG AC AAA GATTGATTAC TCATTTTGAT TATCCATCTG 7 32 0 

CTTTTAAGGA GTAGCTGAAG TTGTTTTAGG TTTGTAGATT GAAATCTTGA CTCTAGTCTT 73 80 

ATTGAGGTCT ACCTTTTCAC CTGCTCTAGG ACTTTGTTCA ACAACCATGC CTTCTGCACT 7440 

ACCTGCAGGC GCTGTCGTCA CTTCTACAAC TTCTATATTA GCTTCCTTAA TCCCAACAAT 7 500 
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TTGAATCAAA TTGTTCTTAG TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATGGCAAC 75 6 0 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTGAGTA GGTTTACTCA CAT C AT AAGT 7 62 0 

CGTTCCGGCA CCTGGACTTT GTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTGC TTGAGTTCTG AGATTACTTC 7 74 0 

TGTAGAGTTC CGTCCAATAT AGTTCCCTAA TTGAATCGTC GTAGCTTTTT TAGCTACTGT 7 800 

CAAAACAATT TGAGTTGCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7 860 

CAGGACCGTT CCAGCCTCAC TCTCATTCGA CTCTTCTTCC TCAATTTTAA TCAAATTATC 7 92 0 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7 9 80 

ACTAATTTGG AAAGATTGCT TGCCTGATGA GACAACCAAA TTGATTTTCG TTCCTTCTTT 8040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCACT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCGTTG CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGAGTTC TGGATAGTAT 822 0 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACGCATCTT 82 8 0 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCTGTCACAG CCTGGCTTGG 83 40 

GTTTTTGATT GATTTGTGTT CTGTTTGCGC TTGAACCTTA GGAATAGATG TCAAGGTACT 8400 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TGCCTTGCTC GTTTCATCAA AGATTAACTT 8460 

ACTTTCATTT CTACGATTGT AGGACAAGCT ACTAGACAAG TCCACATACA TCTCTGAAAC 852 0 

CGAGCGGTAG CGATTGGTCA ACTTTTTAGC AGTTGCCTTG ATAATAACAT TTTCTAAAGC 85 80 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATG CTGG AG 864 0 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 87 00 

AATAATCCCC ATGGCATAGA TATCACTCTG CACAGTCGCC TTCGAACCAC GCGCCTGCTC 87 6 0 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGGTCAGAC TTGTCTCTGC 882 0 

AAAGGCTACA GCAATCCCAA AGTCTGTGAC CTTGGCAGTC CCATCTGGTG TCAAGAGGAT 8 880 

ATTTTGAGGT TTCAAGTCCC TGTGAACAAT TCCTCGAGTA TGGGCCAAGC G CAT AG C C AA 894 0 

GAGAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT 9000 

ATAGCGTTTG AGGTCCAGTC C AGC C AC AT A CTCCATAGCT AGGTACTGTT GACCGTCTTC 9060 

CTCGCCAATA TCTGTTATCC GAACGATATG AGGATGGTCT AGATCTGCCA TAGCTCTCGC 912 0 

TTCACGCTGA AAACGAGCTA CAGCTATCGG GTCCGTCTGG TAGTTGGTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA GTCTTTGGCT AGGTAGACAT CCGCCATACC 9240 

TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC CGATTTGGAT 930 0 
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CATTCTGCAT CCTCCTCGTT CATAGAAACA AGGGCAACCG TAATGTTGTC TAAACCTCCT 93 60 

GCATTGTTAG CAAAACGAAC AAGTGTCTCC GTTTTATCTG CTAAAGGAAT ATCACTGGTT 942 0 

ACAATATCAC GAATCTCACT GCCTGAAATC ATGTTGGTCA AGCCGTCACT ATTGAGCAAG 9480 

AGATAGTCAC CTGACTCAAG GATAACTGTC CCAAAATCAG GCTGAATTTC ATCTTTTTGC 9540 

CCAATAGACT GGGTGATAAT ATTTTTTTGC GGATGAGCTT CTGCCTCTTC TGGTGTCAAT 9600 

TG AC C AG C C T TGAGCAATTC ATTAACCAAG GAATGATCGC TCGTCAACTG ATGGTATTCT 9660 

TCTCCACGAA TCAAGCCGAT ACGCGAATCA CCAATATGAG CATAGATAGC CTG AT T AT C A 9720 

ATAATAGCAA GGACTTCCAA AGTAGTTCCC ATGCCTCTGT AAGCTTCATC CTGACCAAGC 97 80 

TGGTGAATCT TTTGATTTTC AATTTCTAGG TAATGGGCGA ACCATTCACG CACTTCATTG 9840 

AC TGT ATCG A TCTGGGTATC AACCCAAGCT AC AC C C AGGT CTGTGACCGC CATTTCACTA 9900 

GC G AT AT T CC CTGCGCGATG ACCTCCCATC CCATCAGCTA AAATAATCAT GGTACGTCCA 9960 

GCTCTATTGA CATAGTGGTT GACATAGTCT TGGTTATTTG TTCGTTTCTG AC C AAC AT C T 1002 0 

GTTAATAATG AAATTTCCAT GTGTCAGTTC CTTCCTAATC C G AT AT CT T G CGAAATTGAC 10080 

TGATGAAGAA TCCATCACTT CCATACAATT CAGGTGTAAT GAGGATACAG CCGTCTTTCA 10140 

TGATATCCTT ACATTCATGT TCTAGTTTTA CCTGCTCGAA CTCGGGATGA CTCTCTAAAA 10200 

AGGCCTTAAC GACTTGAAAA TTCTCCTCTG AG AC G AT AGT GCAGGTGCTA TAAGTTATTA 102 6 0 

T AC C ACCTTT GCCTAGTATT TGACAAACAC TACCTAATAT TTCTAACTGA ATTTCCTGCA 1032 0 

AGGACGCGAA ATCTGCCGTT TCTTTATTGT ATT TG AT AT C TGGTTTTCGG CGCAAAAGAC 103 8 0 

CGATTCCTGA ACAAGGAGCA TCCACCAAAA TCTTATCAAA GGAATCCTGG TCAAAAAACT 10440 

C AT GC AC C TT TCTGGCATCC AATTTTTGAG TTTGAACCCG ATCTTCAACT CCCAGACGTT 10500 

GGGCATTTTC TTGAATTAAA TCCAACTTGT GGTCGTACAA GTCCAGAGCA GTAACCTGAC 105 60 

CTGTCGTAAG ATAAGAGGCT ATATGGGCTG TTTTCCCACC TGGAGCCGCA CAGGCATCAA 10620 

GCACTCGCTC ATCACCTTGT AAATCAAGCG TCGGAGCAAC CAGCTGACTG GACTCATCTT 10680 

GGATGGTAAT GGCTCCATCC GCAAACAAAT TATGCCCTGC AAAATGCCCC TGCTCCTTAA 1074 0 

CCAGACCAGT GGTTGCTAAA AGGGAATTAT TCGCCTCCAA CAAGGCTTGG ATTTCCTCTT 10800 

TTCGACTTAG GTCTGTTACA CGAATACTGG CTTTGTTTCG CACTAACAGG CTTTCAAAGA 10860 

TGGCTTTTGC TCTCTCCTCT CCGTATTCTT CCTTGAGTTT GGCAACTAGC CAAACTGGGA 1092 0 

GAGAATAGGC AATGGAGTCA CGCTTGTTTT TTCGCTTGAT GCTAGCAATA TCTGGCCAGC 10980 

CTTCACGCAA GATACGGCGA AGGACAGCGT TGACCAATTT TTCACTGCCT TTTTTACGGA 11040 
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GTTTGGCCAA TTCCACTGCT TCATTAACCA C AG CAT GAT C TGGAATCTTG TCCAAATAGC 11100 

GGAGTTGGTA GGCACTCATG AGAAGAAGGA CATAGAGCCA GCTGTCTAAC TGGTCTCTGT 11160 

CT T CG AT AAA GTGGGATAGG TACCATTCCA GAGTCAGTTT ACGGGCTACC GTTCCATAGA 1122 0 

CCAGCTCGGT CACTAAGCCC TTGTCTGCTG CCAAAAGTTG ACTTCCCTTT AGATGCTTAT 11280 

TTAAGGCGAT ATTTGAATAT GCTTGGTTCA C AAAAAC AT C CTCTAGCACT GCTAGAGCTA 11340 

AACTTCTAGC CGTTTCTACT TTAGTCACCA AATCGTTCTC CTACAGTCAA TGTACGTCCA 11400 

ACTCCGTTGA GGAAGGAAGC AATGTCCATC TTAGGCTTAC CAGCTGGCTG CACTTGTTTG 114 60 

AGGGATAGAG CCCCTTCAGC CGTTGCGACA ATCAATTCTT TCTTGCCGAT AG AG AG AAT C 11520 

TCACCTGGAT TTCCCTGACC TTCTACTGGT AGGGCTTCAT AAAT CT T AAA GCGGTCGCCC 11580 

TTAAGGAAAG TATGGGCAAC AGGCCAGGGG TTCATTCCAC GAATTTGGTT AAAGAGTTGA 11640 

CGATTGGTTT TGTTCCAGTC CAGTTTTTCT TCCTCTGGCT TTATATTTGG AGAGAAGGTA 11700 

ACCTGACTCG TATCCTGCGG TTCAGGTTTG ATATCACCAG CAATATAGGC AGGCAGAGTG 117 60 

TCCAAAAGCA AATCACGACC AACTAGCGCC AATTTTTCAA ACAAGGTGCC AACATTGTCC 11820 

TCATCTGTGA TCGGAATGCT GCGACGAGAA AT C AT ATCT C CTGCATCCAT TTCCTTAACC 11880 

ATTTCCATGA TGGTCACACC AGCTTCCTCA TCCCCTTGAA TCAAGGCATA ATGGATAGGC 11940 

GCACCACCAC GGTGTCTAGG AAGGAGGGAG GCATGAACGT TGACAGCAAA GTCCATGCTA 12 000 

TCAAGGAGTT TGCTTGGGAG AAACTGCCCA AAAGCAGCAG TCACAATTCC ATCTGCTCCT 12 0 60 

AGCTTCATAA GATCTTCCAT CTCTGGACTT CCAGAT/vATT TTTCAGGTTG GTAGATAGAT 12120 

AGTCCTGCTT CCTTGGCAGC CTGCTTGACT GGGGTTTCTT GGATAACTTT TTTACGACCA 12180 

ACAGCACGGT CTGGCTGGGT CACAACGGCT AGAATTTCGT AACGGTCATC TGTCAAAAGT 12 2 40 

CCTTTTAAGA CTGTTGCTGA AAAGTCGGGG GTCCCCATAA AGATTAGTTT TGTCATATCT 12 300 

TCTCCTTCTT AT AAAAATT G CTGCGGCTCA TGGTCAATGC TGAGACGGAG CT C AC TAT T T 12 3 60 

TCCCGTTCTT GAGTCAAGGC TAAAACCTGG TTGAGGGTCG ACCCCAGCTC ATCTTCTAAA 1242 0 

CGGTATTTAA TTAAAATCTG GTAATGATAG AGGTTGTGGG TACGGGCAAT CGGTTTTGGC 12480 

GTTGGCCCCA GAATGGGACT GGTCTCTGAC AAGCCTGACC GCAAAATGTT CATGACTTCA 12540 

TAGGCACGTT TGAAAACCTC TTCTTCTTTC TTGTGAGAAA GGGTAATACC AATCGTGAAA 12 600 

TAGTAAGGTG GATAGCCGAG TTGTCGTCTG ATTCCCATTT CATAGGCATA AAAGCCTTCG 12660 

TAATCTTGAT CCTTGGCAAA TCGAATAGCA TAGTGCTGCG GATTGTAGGA CTGTATCAAG 1272 0 

ACTTGACCTG CCTTTTCAGC C CG AC CTG AT CGACCTGCCA CCTGAGTCAA GAGCTGGAAG 12780 

GTTCTCTCAG AAGAACGGAA ATCAGGCAGA TTCAAGGCCG TATCCGCATT TAGAACTCCG 12 840 
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ACTAGGGTAA CATTGGGAAA ATCCAAACCC TTTGCAATCA TCTGAGTACC AAGTAAAATA 12900 

TCCGCTTCCC CTCGCCCAAA CTGGTCAAGC AAGGCTTGGT GACTGCCTTT CTTTCGAGTC 12 9 60 

GTATCCACAT CCATCCTCAG AATGCGAGCT TGGGGAAAGA GTTCTGCTAG CTCATCATAA 13 020 

GCCTTCTGAG TTCCCGTCCC ATAGTAACGA ATACTGCGGC TCTTACAGTT AGG AC AG AC C 13080 

TGAGGAATAT CCTTCGAGAA ACCACAATAA TGGCAGTTCA TAGTCTTGGT ATCCATATGC 13140 

AAGGTCAGAG AAATATCGCA GTTGGGACAA GTATCCACCG TCCCACACTC CCGACACATG 13 200 

ACAAAGCTAG AATAACCACG GCGATTGAGC ATGAGAACCA CCTGCTCTTT TTTAACCAGA 132 60 

CGGTCTTGGA TAGCCTCTAG CAAAGGAGGC GTAAAGTTTG ACGTCTCATT TTGTCCGATA 13 320 

TAGTCTCGAA AGTCAATCAC TTGAACCTCA GGGATTGTAG CCAAAGGATT GGCACGTTGG 133 80 

GT T AG AC GT A AGTGTTGATA GACGCCTTTG CCAGCACGTG CCCGGCTCTC TAAGCTCGGC 13440 

GTTGCAGATC CAAGTACCAG AGTTGCTTGA TTATACTGAG CCCGTAAAAT AGCTACCTCT 13 500 

CTGGCATGGT AACGGGGATT GCTGTCCTGC TTATAAGCCG CTTCATGCTC TTCATCAATA 13 5 60 

ATCATGACAC CCAGATTTTT CAGAGGAGCA AAG AT AG C AG ATCTGGCACC AACAACAACT 13 620 

TGGGCATCGC CACGCTCCAC CTTGCGCCAT T CAT CAT ACT TTTCACCATT GGATAATCCT 13 680 

GAGTGAAGAA TGGCTACCTT GTCCCCAAAA CGTGCTATAA AACGCTCGGT CATCTGAGGA 13740 

GT C AAGG AAA TCTCAGGTAC CAGCAAAATA GCTGTCTTGC CCTTATCCAG GGCACCTTGG 13 800 

ATAATCTGCA AGTAAACCTC GGTCTTCCCA CTTCCTGTAA TCCCTTGAAG TAGAAAGGGA 13 860 

GGTTGAGAAC TGCCAATAGA ACTCACAACC GCATCACGCG CCTGTCTTTG TTCTGGATTT 13 920 

AACTCCAAAG GTCTACTTGC TTCAATTCCT TCAAAATAAG CAGCCGAGCG TTGAACTTCC 13 980 

TTTTGGACTA TGGTAACAGC ACCTTGATCC ACAAAGAAGT TGACTTGCTC TCGCGAGTAG 14 04 0 

GACTCTAACA AGCTAGCCAA GGAAGCGCTC TCTGGATGAG ACAGCAGATA ATCTCTCAGT 14100 

TCCAACTTTT TCTTGGCACG TGTAGAAATC TCAACACCTT CTAATTGAGC ATGGTCAACC 14160 

T C AT ACC AAG ACTGGGTCTT GACCTTCTTT TGATCGACTG CCTGATATTC CAGACCAAGC 1422 0 

AGGCCTTTTC TAGTCAAACG CATCATTTCA GCTTGCTTGG CAAGGTCTAG TGAAGAAAAG 14 2 80 

GCTAGCGAAT CTTCTGAACC AAACAGGCGC ACTCGTTCTT CCTGACTCAA GCCTTCCAGA 14 3 40 

GGATAGAGAA TCTTGTCATA GCTAGAATTC AGAAACCCTG GAAGCATGGC CTTGAGGATA 14 4 00 

GAGATTTTGT AG GAG AAG AC AGATTTGCGT AACTCCTCAG CCAGCCAGAG TTGTTCTGGC 144 60 

GTGAGAACAG GAGAAAAATC CAGCACCTCT GCAATATCTT TTAAATCTTG CTCCATCTCT 14 52 0 

TCTCCATCTG ATTGGGACTT CAAACCAAGA ACAATCCCTT GAATCAGGCG ATTACCCTTA 145 80 



WO 98/18931 



PCT/US97/19588 



702 



CCAAAAGGCA 


CATGAACCCG 


CATCCCAACT 


TCCAGCATTC 


CCTCAAATTC 


CTCCGGAATC 


14640 


CTGTAACTAT 


AGGGCTGGTC 


CGTCTGCATC 


AAGGGCACAT 


CTACGATAAT 


CTTAGCTAGG 


14700 


GCCATCTTCT 


CACCTCCTCC 


TTGTCAGTAC 


ATTCTTGCAA 


TAGAAAAAAT 


AAGATTGAGT 


14760 


CCCCCCAACC 


TTAAATTTTT 


TCACCATCTT 


CTTTTTCTTT 


AGCAATTTGC 


TCTTTGATTT 


14820 


TCTTTTCTTC 


TTCTTCTTTG 


CGGCGTTTTT 


CTTCTTCGAT 


ACGGCGACGC 


ACTGCTTCAC 


14880 


GTTTTCCTTC 


TGGATCTGGG 


TGAATTGTAA 


CGTTTCCTGA 


TTCGATTTCT 


TCTAAAGCGC 


14940 


GAAGAGTTGA 


TTTTTCAGAC 


TTGAAACCTT 


GAGTTGCTGG 


GGCACCTGCT 


TCCAATTCGT 


15000 


GGGCACGTTT 


TGCTTCCAAG 


ATTACGAGTG 


AATATTTTGA 


AGGAACCTTG 


TCGAGCAAGG 


15060 


TATCAATAGA 


GGGTTTTAAC 


ATCATTTGCT 


TGTACCTATT 


TTCTAAATTT 


TATCGGGTAG 


15120 


TTGGAGATTT 


TGGTAACATC 


TCCTGATAGT 


GACCAATGAC 


ACGATCCACA 


CAGAAGTGTT 


15180 


CTGCTTCAAT 


C AC AC AT T TG 


AC ACGT T C AG 


CAGCTAGGGG 


TACCTGATCG 


TTGACAATCG 


15240 


CATAATCATA 


CTCACGCATG 


AGGGCAATTT 


CTTCCTTGGC 


CTTTTCGATT 


CGTTGGGCAA 


15300 


TCACTTCTGC 


ACTATCTGTT 


CCACGACCTA 


CCAAGCGATC 


TTGCAATTCA 


TCCAAATCTG 


15360 


GTGGTGTCAG 


GAAGATAAAG 


ACAGCATCTG 


GAACCTTTTT 


CTTGACCTGA 


AG AG C AC C C T 


15420 


GAACTTCAAT 


TTCAAGGAAA 


ACATCGATTC 


CCTTGTCCAA 


GGTTTCATTG 


ACATAGGTCA 


15480 


GAGGAGTTCC 


ATAGTAGTTA 


CCG AC AT AT T 


CTGCGTATTC 


CAACATCTGT 


CCTTGACGAA 


15540 


TCAGCTCTTC 


AAATTCTTCA 


CGAGTACGGA 


AGAAATAGTC 


AACACCGTCC 


ACTTCTCCAG 


15600 


GACGTTGTGC 


GCGTGTCGTC 


ATCGATACAG 


AATATTGAAA 


TTGGTTTTCA 


GAACTCTCAA 


15660 


AAATCTCTCT 


TCTAACCGTT 


CCTTTTCCAA 


CCCCTGAAGG 


ACCAGAAAAA 


ACGATTAGTA 


15720 


AGCCTCGGTC 


TGCCATTGTG 


TCTCCTTTTA 


GTCAATCTGT 


GAAATAACAT 


TTCTCTAGAA 


15780 


TAATGGCAAA 


AAGCCAGATT 


ATCCTTTACA 


GTCTTTCTAT 


CTAGTGTAAC 


AAAAAAGCAG 


15840 


TAATTTTTCA 


ACTGCTCTTT 


CTTATTTATT 


TAGCATAATC 


T AC TG C AC G A 


AGCTCGCGAA 


15900 


TCACGGTTAC 


CTTGATATTT 


CCTGGATAAT 


CGAGATTGTT 


TTCAATTTTC 


TTACGAACTT 


15960 


TGTGAGCCAA 


GATTGTGACT 


TTGTCGTCCT 


TGATTTTTCC 


TGGATTGACC 


ATGATACGAA 


16020 


TTTCACGTCC 


TGCTTGAAGG 


GCAAAGCTAG 


TTTGCACTCC 


TTCAAAGCCG 


TTAGCAATTT 


16080 


CTTCCAAATC 


ATGGAGACGC 


TTGATGTAGC 


TTTCAAGAGA 


CTCACTACGA 


GCACCTGGAC 


16140 


GGGCTGCGCT 


CAAGGCATCT 


GCTGCAGCGA 


CGATAACTGC 


TATCACGCTC 


TCAGCTTCAA 


16200 


CATCTCCGTG 


GTGACTAGCA 


ATCGTATTCA 


CCACAACTGG 


GGGTTCCTTG 


TACTTACGGG 


16260 


CCAATTCCAT 


ACCGATTTCA 


ACGTGGCTAC 


CTTCAACCTC 


ATGGTCAATG 


GCTTTCCCGA 


16320 


TATCGTGAAG 


GAATCCAGCA 


CGACGGGCAA 


GAGCCGCATT 


TTCACCAAGT 


TCGCTCGCCA 


16380 
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TGATACCAGC CAACTTAGCA ACCTCAATCG AATGGCGCAA AACATTTTGT CCATATGAAG 16440 

TACGGAACTG CAAACGTCCC ATAATCTTCA TCAAGTCTGG ATGAAGGTTT GGCGCACCAA 16500 

TTTCATAGGC AGCAGCCTCA CCGTATTCAC GAATCTTATT GTCAATCTCT TGACGGTTTT 16560 

TCTCAACCAA CTCTTCGATA CGAGCTGGAT GTATACGACC ATCTTTGAGC AACATTTCCA 16 620 

TAGTCATACG GGCAATCTCA CGACGAATCG GATCAAATCC TGACAAGGTC ACCACTTCTG 16680 

GTGTATCGTC GATAATCACA TCGACCCCTG TCAAACTTTC AAAGG T AC G A ATGTTACGAC 16740 

CTTCACGACC AATAATGCGT CCCTTCATAG TATCGTCTGG CAGATGAACT GTTGAGTTTG 16800 

TTGACTCCGC TACATATTCA CCAGCGATAC GTTGCATAGC TTGAACCAAG ATGTCCTTGG 16860 

CCATTTTGTC AGAACGTTCC TTGACCTCTT GCTCAGCTTC GCGAATGCGA CTGGCAATCT 16 920 

CCCTGGTCAA GTTTTCCTCT GTCTGAGCCA AG AT AAT AT C TCGTGCTTCT GCCTGAGACA 16980 

GCGCACCAAT ACGCTCTAGT TCTGCTTCTT TTTGTCTTTC GACTTCCTCT AATTGCTCTT 17040 

CACGCGCATC AAGGTTTTTC GCTCTATCAG AAATACTTTG TTCTTTTTGT TCAAGTGTTT 17100 

GTTCTTTACT CGTCAAATTG TCGTCCTTAC GGTCAAGGCT AGTAGCTCTC TCTGTCAAAC 17160 

GACTTTCGAT TTGTTTGAGT TCTTGACGTT CTGATTTGAA TTCAGCGTCC ACTTCTTCAC 1722 0 

GGTATTTTCT GGCTTCTTCT TTGGCCTCCA ATAGTGCTTC TTTTTTAAGA GACTTGCTTT 172 80 

CACGTTTGGC TTCATTAACA AGTAAATCCG CTTCACGCTC AGCTTGTCCA CGTAAATTAG 17 340 

TTGCTTCTTG TTCAGCATTT AAAAGCATCA ACTCTGCAGC TTCCTGAGAT GATTTCATCT 174 00 

TAGCTGAGAT G CTG AC AT AT CCAATGACTA AACCAATGAT GACGGCAAAA ACAGCAATCG 174 60 

CAAGCGACAT GATTTCCATG TTTTTACCTC ATTTTATTGT T ATT C CG AAT GACATACATT 17 520 

CTTTTACATT C T AC CAT AAA AAAGTGATTT TCACAAACCT AAAATAGAAT ATGTTTTGAG 17 580 

GAATTTGGAA CACATTTACC AAAATAAACT TGTTGTTTAG AAATAGTAGT TTAGTAGAGA 17 64 0 

CTTGAGAAAA AGCCTACCTT T C AAT AG AC T TAGTAATGAT CTTTAAAGGA CAAGAAAGCC 17 700 

ACGCTATCTC CATCCATCAT ATAAATCAAG CGATTTTCTG CATCAATACG CCGT G AC C AG 177 60 

GCTCCTTGGT AATCATATTT GAGTGGTTCT GGTTTACCTA TTCCTGTAAA GGGATCACGT 17 82 0 

TGAATATCCT TGATTAGTTT ATTGATTCTT TTTAACGTTT TCTTATCCTG ATTTTGCCAG 178 80 

TAGCAATAAT CTGCCCAGGC ATCTTCTGTA AACTTGAGCA GCATTTCTTA CTCCTCAATA 17 940 

ACATGGACCT GAGTACTTCC AGCACGAACT TGAGCCATTC CTCGCAAAAC CTTATCAGAA 18 000 

AGTTCCTTAT TTTGAGCAAT TCTCAGGGTT TCTTGGATAC TATCCCACTC ACTCTTTGAA 1806 0 

AGGACTACAA TGTCCTCATC TGGATTTTTA TTGACCACCG TCAAAGGCTC AAATTCATCA 1812 0 
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TTTACCTTCT TCATGTAGTC CTTTAAATGA TTTCGGAATG TTGAGTAAAG GACTGCTTCC 18180 

AT AAC C AT AC CTCGTTTTAG CTCTTTTCCA C TAT TAT AC A CGAAAAGAAA GAAATTGTCA 18240 

GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTT AT ACT C AATGAAAATC AAAGAGCAAA 18300 

CTAGGAAACT AGCCGCAGGC TGTACTTGAG TACGGCAAGG CGACGTTGAC GCGATTTGAA 183 60 

TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGGTAGACT 18420 

TAGTTTGTTT CTATTC 18436 
(2) INFORMATION FOR SEQ ID NO: 88: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



ACGTAGAAAA 


ACTATTTCTA 


TCACAGATAA 


TATTCCGTAT 


GTTGTTGGAG 


GTATTGAAAT 


60 


AAACGTCCTA 


GGTATCTTTC 


TCAGTCTATG 


TGACTTACAA 


GGGAAAACTC 


TTTTCGAGAC 


120 


AGAAATTTTG 


AATGAAGATT 


ATCCTATTTC 


AGAAATCAAT 


TCCACCATTA 


CCAATATGAT 


180 


AAAAACAGCT 


ATAGAGTACG 


TCCCTTTGGA 


AACAAAATTA 


CTTGGATTTG 


GCTTATCAAT 


240 


ACCTGGACAT 


TATAACAAAG 


ACTCCGGAAG 


TATCATTACA 


AACAACCCCA 


TATGGGAATC 


300 


TTTTAATTTA 


TTAAATGTAA 


TTAAAAGATT 


CAATTTTCCT 


TTTATTGTAA 


AAAATAATAT 


360 


CGATTGTATG 


GCTATAGGAC 


AATACCTTTT 


TAATCCACAC 


AATACCCCCG 


ATAACTTTAT 


420 


TTTCCTACAC 


GCTGGATTAG 


GTATTTACAC 


TTCCTTTTTC 


ACAAAAGAAA 


AAATAGGAGC 


480 


CTCTAAAAAT 


CCTTATATCG 


GAG AAATT GG 


ACACACCATT 


GTCGAATTGA 


ATGGGCAATA 


540 


TTGTGAATGC 


GGAAAAAAAG 


GTTGTTTACA 


AACATATATT 


TCGGATGCTT 


GGTTAATCAA 


600 


ACACGCCCAA 


TTATTATTTA 


AAAATTCCCA 


ACTAACTGTA 


CTAAAAAGCC 


TTGTAAAGAC 


660 


T G AAAAAG AC 


ATTCATTTAG 


ACACCCTTTT 


AACGGCTTAT 


AATTTAGGCG 


ACTCCGCTTT 


720 


ACGTCAACAA 


AT TG AT AAAG 


GAGTCAATTT 


ATTAGCCACT 


TCTATTGCAA 


ATCTCCTCCT 


780 


CATCAATCCT 


GCTGATAAAA 


TCTATATCAA 


CAGTCAATTG 


CTTAATTATC 


AACCTTTCAC 


840 


TCATGAAGTC 


AGGGATAAAA 


TCCAAGACCA 


GCTCCACTTC 


GTTCCCTTTA 


CTCGTAATAT 


900 


AGAAATTGAA 


ATTTTACCTT 


ACAACAAACA 


TCGTGGAAGT 


ATAGGAGCTT 


GTGCATTAGC 


960 


TATCGTCGCT 


TTTTTCATAG 


AACATAGCAA 


TGTATTACAA 


GATATTATTT 


CACCTTAATA 


1020 


TATTAGAAAT 


CTATAGACCT 


GTTTAAATCA 


ACTATAACCT 


GTAGTAGATA 


TCTCGTATTT 


1080 
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AGACAATATG AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG 1140 

ATTTATATTA AAATAACTTT TCTTCTAGCT GCATTTTATT AT T AT AAAAA CATTCATCAT 12 00 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 12 60 

TATGAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 13 2 0 

TTTACATATA GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 13 80 

TTTGAATAGG GCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGCTCAAA 144 0 

AACAATCAAT TGATGCTAAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 1500 

GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GG CAT CAT AA CTCTAAAAAG 15 60 

TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 162 0 

AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 1680 

AATCCCTAAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGTAAATGGT TGTTGTATAA 174 0 

TAAAGTTACA AAGATGTACG AC C AC AC TGT TGTAAATCAT AGTGTTCGCG AATATATTAC 1800 

T GAT AG CAT T TCTACAAATA CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 18 60 

GTGCTTTGGC ATTCCTAGCC TT CAT AT CAT TTAAAGAATT CTATAGACAA AATTTTTTCC 192 0 

AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 19 80 

GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT C T AAG AC AAA TAAGCTAAAA 2 04 0 

AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 2100 

AATACTAGTG TAACTATCCT TCCAGTCAGA AGCTTGTCAA ATCACACCGA AAATTCTTCT 2160 

AAAATTTATC TCGTTAGGCA AT C AAGC AAA AACTCGACGA TAGTACAAAC AT TAT CAT AC 22 2 0 

AGGATTGACT TCCTAAATTA TATACTTTAG TAAGGTTTTC GGATAAGAAA AAAGGTTCAT 228 0 

TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 2 3 40 

AGCAACAAGA AGATTTTCAG TATCATCCTA TAGATACGAG CTAATTAAGA AAAAC T AC AT 2400 

TTTTGAATAT AAACTACAAT AATATAAACT AAATTTTATA GGAGGAAGAC AATGGATTGG 24 60 

TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 2 52 0 

TATTTGCGAA AAGTTATTTT TGAAGACTAT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 2 580 

TTGCTAGACT CCAAAGAACT AACCCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 2 640 

G AGCAT ACT C CAACTCATAA ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 2 7 00 

GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 2 7 60 

AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 282 0 
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CTTAGAATAT CATGGTTTAC GTGCCACACA AGATGTTGAT GCTTTTATGG CTCTATAATA 2 880 

TTTGTAGTGG GTAAATCCCC TATGGATATT ATGGAGCCTA TTTTTGTGTA GAAAAAAAGT 2 940 

CCCATATGAC CTATAATGAA AAGCGACAAA AC AAC T C ATT AGAAAGAATC ATATGGAACA 3000 

ATTACATTTT AT C AC AAAAT TACTAGACAT TAAAGACCCT AAT AT C C AG A T T TT AG AC AT 3 0 60 

CATCAATAAG GATACACACA AGGAAATCAT CGCCAAACTG G ACT AC G ACG CCCCATCTTG 312 0 

CCCTGAGTGC GGAAACCAAT TGAAGAAATA TGACTTTCAA AAACCGTCTA AGATCCCTTA 3180 

CCTCGAAACA ACTGGTATGC CTTCTAGAAT TCTCCTTAGA AAACGCCGTT TCAAGTGCTA 3 240 

TCACTGTTCA AAAATGATGG TCGCTGAAAC TTCTATCGTC AAGAAGAATC ATCAAATTCC 33 00 

TCGTATTATC AACCAAAAAA TTGCGCAAAA GTTGATTGAG AAGATTTCTA TGACCGATAT 33 60 

TGCTCATCAG CTGGCCATTT CAACTTCAAC TGTCATTCGC AAGCTCAATG ATTCTCACTT 3420 

TGAGCATGAT TTTTCGCGTC TTCCTGAGAT TATGTCCTGG GACGTTGAAA CAGTCCGGGG 3480 

AGTGACTGTT TCAATCGGGA GATGGAGATG AGCTTTATTG CGCAAGATTT TGAAAAGCTC 3 540 

GAT AT CAT C A CTGTTCTTGA AGGTAGAACA CAAGCTGTCA TCCGAGATCA CTTTCTTAAA 3 600 

TATGATAGAG CCGTCCGATG TCGCGTCAAA ATTATTACTA TGGATATGTT TAGTCCTTAC 3 660 

TATGACTTAG CTAGACAACT TTTCCCGTGT GCTAAAATCG TTCTTGATCG CT T T C AC AT T 372 0 

GT AC AAC AT C TTAGCCGTGC TATGAGTCGT GTGCGTGTCC AAATCATGAA TCAGTTTCAT 3 7 80 

CG AAAAT C C C ATGAATACAA GGCTATCAAG CGCTACTGGA AACTCATTCA AC AGG AT AG C 3 840 

CGTAAACTCA GCGATAAACA TTTTTATCGC CCTACTTTTC GTATGCATTT AAC C AAT AAA 3 900 

GAGATTTTAG ACAAGCTTTT GAGCTATTCA CAAGACTTGA AACATCACTA TCAGCTCTAT 3 960 

CAACTCTTGC TGTTTCACTT TCAGAATAAG G AAC CGG AG A AATTTTTCGA ACTTATCGAG 4020 

GACAATCTTA AGCAGGTTCA TCCTATTTTT CAGACTGTCT TTAAAACCTT CCTCAAAGAT 4080 

AAAGAAAAGG TTATCAACGC CCTTCAACTA CACTATTCTA ATGCCAAACT GGAAGCGACC 4140 

AAT AATCT C A TCAAACTTAT CAAGCGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 4200 

AAAAAACGGA TTTTTATCGC TCTGAATATC AAAAAAGAAA GGACAAAATT TGTCCTTTCT 42 60 

CGAGCTTAGC TTTTTTTCAA CCCACTACAG TTGACAAAGA GCCGGAAAAA GGAACAGCCT 43 2 0 

TAGCTTTCCT TTCATTTCTT TTTATTTCCC TCGTAGTAAA CGTGCTAGCT TCCACAAAAC 43 80 

AAACAGGATT CCCAGAAATG CCAGTACCAC TAGCCCACGG TACAACCATT GAGAGGTTGC 4440 

AACACGCGAT ACAGATTGTC CTTCTTTCGT AAAAGC AAC C CTCGCAACTG CAGCTGTTTG 4500 

TGGATCTGAT TTTTGATAAA CAGCGACTCG TTC AAAAT TC AC T AAT AAGC GTTTATTAAA 4560 

GGTAGGAATC GGATCGCAGG TTATCAAGGT CATGATATTT TTAGAGCTAA CCGATTCTAA 4 62 0 
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TTTTTCCCAT TCCGACGGTA AAATAATCTC TGTGTCCATC ATCTGATATT CTACAATTTC 4 680 

CTGGCCATTA TCATAATAAA GAGCATCTCC AACTTTTAGC TGATCCAAAT GGCGGAAAAA 474 0 

G AC ATGGCT T GGCTCTGCAC GGTGCCCAGC AATC AC TG AG CGAATCCCTG TACCATCCAG 4800 

AGGCAGCGGT GTACCATCCA CATGAGCCAA GCCCATCCCT AAATGATGAT AATCTGCTCC 4860 

CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC AAGTAACCAT AG AC TGC AT C 492 0 

AGGGTCGTCA GACACTTGGT AATTGACCTC ATATCCCTCC GCCAAAAAAG GATCTACAAT 4 980 

GCGATTTTGC GAAGCCAAGC GTTGATTGTA GGCGAGAGAA TGGTTCTGTT GTTCTTGGTA 5040 

CATTTCAGTT GTCATGGATT TCACAAATGT AGCATGACCT TTCACCTGTC CAAGAGACTG 5100 

CAACACCATC TGTCCAAAAC AATAAATAGG AATCAAACAG GCTACCAACA TCAACAAGTA 5160 

TCCCAATAAG GCTCGTAGTT TAGTCCTTGA CATGACGCCC CTCCAATTGC TTTTCTAGTC 522 0 

CTTTGACAAT CCGTCGATTA CGATACACGC GATACAGCAA G AG AAGG AT G ACCGCCATCG 52 80 

CTCCTAGTAA TAACCACAAC CAGAATTGCC CACGCTCTCT CACCGCTCGA TTCCGCTCTG 53 40 

CAATTGGTGC CGTATACGGA ATCCGCTTCC CACGTACCAA CAGACGATGA CTGTTAATCA 54 00 

TATACGGTGT ACAAGTCAAC AAGGTCGCAT AATCTTCCCC ATGTTGAATC AAGACAGGCT 54 60 

CAAAGTCATT CGGCTCCACC GTCACTATCT GATCCACTTG GTAGGCCAAC ACCTGATCTA 5 52 0 

AAACGTGAAG ATAAAAGATA TCCCCTTTTT TCATCTTATC CAATTGACTG AACAATTCTG 55 80 

CCGTTGGCAA TCCTCTGTGA GCAGTGATCA CTGTATGGGT ATTTTCACCT CCAACAGGCA 5 64 0 

GCGAAGCCCC TTCTAACAGC CCTGCCCCTT TCTGAAGAAT GTCCTCACTC GT T C C G AC AT 57 00 

ACATCGGAAT TTCCTGATCA ATCGCAGGAA TTT C C AC AT A GCCAATCCGC TCATGGACCT 57 60 

TTAGCATATT GGCATATTCT GAGACGCCTT TCTTTTTCTC TTGCTCTGTA AAAGGATCAA 582 0 

GAATTTCAGA TGGTTTCAAG GTCGCATTGA AGGCTTGAGC CAAGCGCCAA CGCTCCTCAA 5 8 80 

GTTCTGCCTT ATCCATCTGG GAAACCGTCT CATCAAACTC TTTAATAACC TCGTTTGACT 5 940 

CAATACGATA ATAATAACGA GACACCAATG GATATATCGC AACGGCGAAT CCTACTAAGA 6000 

AAATCAGAAG AAGGATCAGC GGATGTTTCT TCTTTTTTGT GCCTTTTTTT CGTGAACGTC 60 60 

TACTGTTGTC CATCCTCCAC CTTCACTTCC TTCCTTGCTG CTTTCAGCGC CTTCAAAGCC 612 0 

TTTTCCGGTT GTTTTTTCTT CTTGCGCAAG CGTCGAATAA TCCATAAAAG AATCACAATC 6180 

AAACCAACTG CCACATAAAA CAGGTAGCGA TAGAGATGAC TGAGTTTGTT TGCTGCAATA 62 40 

AATTCTTCCT CAACCTCTGC TACGTACGGT ATCCGATGCC CCCGAACCAA TAGACGATGG 6300 

GT AT TG AT CA TGTATGGCGT ACAAGTCAGC AAGGTCACAT AATCATGACC TGGTACAATC 63 60 
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AAT AAAT CAT CAAAGTTCGT CGGCTCAATC ACCTTTACTT GAT C C AC T TG ATAGGCCATC 642 0 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTGGT CAAATCCGTA 6480 

AACATCTTAG CTGTTGGCAA ACCTGTATGT GCCGT AATCA CCGCATGGGT CGAATTGCCT 6540 

CCGATCGGCA GAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6 600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGGGGA TTTCCACATG CCCCATCCGC 6660 

TCATGGATTT CTAACATACG TGCATACTCT GCTCGCCCTT TTTTCTTCAT TTCTTCCGAC 672 0 

CAAGGATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTGTGC CAATTTCATT 67 8 0 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA ATACAAGCGA GACACCAGCG G AT ACGC C AT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 69 60 

ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GAGGGAGCTT AAGAAGTTAC CACCGTCCTC TAGCGCCTTA TCCGCATCAA AGTTAAGGTT 6 0 

GATATTTTTA AAACTGTCGC CAGCTTGTGA TACGATGCTT TGTTTAAGGT CATTTAGGGT 12 0 

T T T AGTG AAA TCTGCATTGC TGAGGATATC ACTCTTTGAG AGATTCAAGG CAAAATTGAT 180 

GATGATATTG ATCTGGTTTC CTGTTATGAC CTGATCAAGT TTGTAATTTT TTAAGGTATC 24 0 

TTCAACAATC T TGCGG AT AT CTTCTTCTGT CAGATTTCCC TTACTTTCTT TAGCTTTGGC 3 00 

GAGTCCTGAC TTG AT AT C AG CTAGGGCAAC GTTTAATTTA T TAG CAT CAT AGCCTGATTT 3 60 

GTCCTTGTTT TCAGCATTGA TAT C TG AC AA AGCTTTTAGC TCTTCTTGAG CCAAATCTTT 42 0 

ATTAGCTTGT GGCACCTTGG CTCCATTAGC CTCTAGCGAA TAGTAAATCC CTGCTAAAGC 4 80 

ACTTTCTCCT GTAACTGGAA TAGGGGCTGC TACAGTGATT TTGGCATGTT CCATACCCAG 540 

CGTTACTGCT GCGTTTCGGT ACATATCCTG AGTCACCTTA GTGATATTTT CTGGTGTTTC 600 

AATCTTGACC TCAAGTGGCG ATTTGTC AC C TAGCTTTTGA ATCTTGGCTG ATGAATACAA 660 

CTGTAAGCTA GAGTCATTGG C C AC ATT CAT GATTTTAGAA TAAACATCAG GTGTCATGGT 72 0 

CTTGAGTTCT TTGGTATCTG TTGAGGCATT GTAGCCCAGT TTTTTAAGAG TTTGATTTTT 7 80 
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TTGGTCTTCA GATAGGGAGG AACCTAGGAC ATATTCAGGT TGGACATAGG TTTCATCGAT 84 0 

AACTTTTTGA ACATCTGTTG CTGCATGGAC GCT ATT CAT A GCTGTTACTG CCCACAAGAT 900 

CGCAGCGCTA GTCAGAAAGA GTTTCTTTCT CATAGGGAAT TTCCTCCTTT ACTTCTTTAG 9 60 

AGTAATATAT C T AT CT T AAA GAAAACTTAT AACAAAAACA CCTGGTCTAG CCAGATGTTG 102 0 

AAAAGAGAGT GAAACATTTG ATGATGTAAA GGTTAAGTCG TACCTGTCTA GAATAATAAT 108 0 

AGTTTCCTCC ATTTACATAG AGTTCAGCAC CGTGAAAAAT GGAAATGGGG TGAATATAAC 114 0 

TATAAGTCTT TCCAGTCCTA TTACCAAGCA AGGGGGCAAC AGTCTCACGA GAG T ACTGT T 12 00 

TGGCTAGAGC CAGGGTATTT TCCTTGCCAT TTTGGGCGAT AAAAT C GAT A TAGGCAGGTC 12 60 

CAAAATTATA GGCTTGAACA GCTGTCCAGA TATCTACCCC CTTCTTCTGC GC C AG AT AG A 132 0 

GATTGCCTGT CAGAGTTTGA ATGCCTTGCC GAATGCTAGA GGCATTATCA TTGATGGTGT 13 8 0 

TGGTGGAACC AC T TG C AG AC T C ACT AG AC T GC AT AAC AT C GCCTTCTTTT CCTTTTGTTT 1440 

C AG TAT AAAT CATAGCAAGC ACAAGCTCTT CGTTTGCTGG GGTGTCTTGT TCACTCAATA 1500 

TTTCTCGCAC CATGGGTTGA TAGGTCATGA CTTGTTTGAC ATCTTGATGA ACGCGGTAAG 1560 

CTTTATAGCC AGCAAAAAGG AAGACTGCTA GTACAAGCAC TCTTCGAATT CGTTTAAACA 162 0 

TTATTTACTT TGGATATCCT CGATATTTTT GATTAAGATA GAGTAGGTTC CATTTTCGTT 1680 

TTGGATAAAC T C AAC AG ACT CGGCGTCTTG ATAGACGTTA TTGGGAACGA TGAGCTCAAT 174 0 

TCCATTTGAT AAGGAGAGTT TTTGGTTTTC AAATTTCTTT AATTGGCGAC TGGCATCAAT 1800 

TTCATCAAAT TG AAC AGG T T CTGGTACGGC TTCTTTGACT TGGTCAATAA AGCTCAAACG 1860 

AGCCGTCAGA TTGTTGTCAA AAAGGTCATT AGCCAATTTC TCAGGTGACA ATTCATTGCT 192 0 

TTCTTCTAGG TTGTTGAAAA TAGCTGATTT GACCTTGGAT TG AAAT TG AA AATCATCTGT 19 8 0 

GTTAAAAGAT TTAGCAATTC TCTGGGCTGT TTTTTCCAGT T C CTTG AT AG ATTTTTTAGG 2 04 0 

AGAAATCTTA GGAGCGACAG CAAGAAGATT ATCTGAAAAA TAGTTCAAAA AAGTCCCGTT 2100 

GTACTTGATT CGTTTTTCAA TCAGGTGATA CTTGCTACTC TGAAGATTGA CCACCAAGGC 2160 

CTCATCAGCT CCTGTTCCAA ATCCAGGCAG GTTATTCTGA GTTAGCTTGA TTGGATTATC 2 220 

AACTTCTCCT CCGAGGTGGG TCAAGGTCTC CCGCAGGGCA ATTCGCAAGA AAGCGAAATG 22 80 

TTCTACACCT TCTTTAGAAA ATT GC AC AAA AATCAAGTCA TTGGTCTTGA GATTTTCAGA 2 340 

AATGCTAAAC TCCTCTTTCC AG AG AT T AGC CAGCGTTACT GATGTCTCCA ACAAATCGTC 2400 

TGTAATATGA TTGAAGAAGG GATTTTCTTC TTCGAAAATC CCAGTCTTGG CTTCATCTGA 2460 

ATACACATGT TCAATTTTTT TACGCAGGTA TTCTTCGATT TTTGGAGTAA TATTGAGAAA 2 52 0 
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CTTATCTGCT 


AAGAACAGTT 


CGGT AT CAT C 


CGGACTGAAC 


TGGTGAATAA 


TGGCTTTCTT 


2580 


AATATAAATG 


TCCATAAAAG 


TTTTAGTCCT 


CGTATAATGG 


GAAGGCATCT 


GTCAATTCTT 


2640 


TGACTGCACT 


TCTCACTTCT 


TCTAATACAG 


CCTCATTTTC 


TGAATTCTTA 


AGGGTTTTAA 


2700 


TGATGAGTTC 


AGCCACTTTG 


CGACTTTCTT 


C T TC AC C AAA 


TCCACGTGCA 


GTAATGGCTG 


2760 


CTGCTCCGAT 


ACGAATCCCA 


CTTGTCTTGA 


ATGGTGACAA 


GCTTTCGTAA 


GGGATTGAGT 


2820 


TTTTATTTAA 


GGTAATATTG 


ACTTCATCCA 


ACAAGTTTTG 


AGCAACTTTG 


CCGTTTTCTA 


2880 


CAACTTTAGT 


CACATCAACA 


AGGAAGAGAT 


GGTTTTCAGT 


TCCACCTGAA 


ATAATACGGA 


2940 


AATCAGGGTC 


TTGCAAGAAG 


ACATCTGCCA 


TAGCCTTGCT 


GTTCTTAATT 


AC AT TGGC AG 


3000 


CATATTCCTT 


GAAGGCTGGA 


TCCAAAACTT 


CTTTGAAGGA 


AACTGCCTTA 


GCCGCCACAA 


3060 


CATGCTCTAA 


AGGACCGCCC 


TGAATACCTG 


GGAAAATAGC 


TGAATTGATT 


TTTTTAGCAA 


3120 


GTTCTTCGTC 


ATTGGTCAAA 


ATCAAACCAC 


CACGAGGTCC 


ACGAAGGGTT 


TTGTGGGTCG 


3180 


TTGTTGTTGT 


GATATGAGCG 


TATGGAACTG 


GGCTTGGATG 


AAGGCCAGCC 


GCAACCAAGC 


3240 


CAGCGATATG 


GGCCATGTCC 


ACCATGAGCT 


TCGCACCGAC 


AGCATCTGCG 


ATTTCACGGA 


3300 


ATTTTGAAAA 


ATCGATAATT 


TGAGAATAGG 


CTGAAGCACC 


AGCTACAATC 


AGTTTTGGTT 


3360 


TTACTTCTTG 


GGCTTGTTTC 


AAGATAGCAT 


CAAAGTCTAA 


GAGTTCCGTT 


TTAGGATCAA 


3420 


CACTATAAGA 


AACAAAGTTG 


TAGGTTTGAC 


CAGAGAAGCT 


AACAGGAGCC 


C CAT G AGT C A 


3480 


AATG AC C AC C 


TGATGCCAAA 


TCCATTCCCA 


TAACCGTATC 


ACCTGGCTCA 


ATCAAGGACA 


3540 


TGTAAGCCGC 


ACAGTTAGCT 


TGGCTTCCTG 


AATGTGGTTG 


AACATTGGCA 


AATTTAGCAC 


3600 


CGAAAAT TT C 


TTTTGCGCGT 


TCAATAGCAA 


GAGTCTCTAC 


AACGTCTACT 


ACATCAGTTC 


3660 


CACCATAATA 


ACGGCGTCCT 


GGGTAACCCT 


CGGCATATTT 


ATTTGTCAAG 


AT AG AC C C T T 


3720 


GAGCTGCCAT 


AACAGCCTTG 


GAAACTACGT 


TTTCCGAAGC 


AATTAACTCG 


AT ATT AT TT T 


3780 


GTTGGCGTTC 


TTCTTCTTTG 


GCAATAGCAT 


TCCAGAGATC 


AGC AT C AT AT 


GCTTTAAAAT 


3840 


CATCTTTGTC 


AAAAAT CAT A 


GGTCTTCTCC 


TTTATTGTGT 


GACTAGTCCA 


TTAGTTTGAT 


3900 


TTTACAATAA 


GAAAATCAAA 


CTAACAGATG 


CGAAT AAAC C 


GTTTCTGCAT 


T TT AT C AC AA 


3960 


GTATAGCCAA 


CTTTTTCATA 


AAATGC AT G A 


GCACCCAGAC 


GATGATTGGC 


AGAATTTAAG 


4020 


CGGAT AAAC C 


CATAACCACA 


TCTTTTTGCT 


TCTTCTTCCA 


ACCCTTGTAG 


TAAACTTTTA 


4080 


CCAATACCTT 


GACCTTGCGC 


TTGAGGTGAA 


ACTGCTAAAG 


CTAAGATATT 


AAATCCTGCT 


4140 


TTGGAATAGA 


GTGATTCGTA 


AACTTCAGCG 


TGGACATATC 


CAAGTAAGAC 


ATGATTAGCT 


4200 


GCATCCTCAT 


AGCCAAGTAG 


GAAATGATGG 


GAATCCTGAG 


ACAGTCTAGC 


TAGTTGGCTA 


4260 


GCCGTTTCCT 


CTGGACTAAA 


AGTATAACCC 


AAAGCCTCTT 


GGTTGATGTC 


ACATATAGCT 


4320 
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1 I LAtA 1 LAG 


TTTCTCTTAA 


ATCTCTTAGC 


ATCTCATTCC 


TCCTCAAAAG 


AAATCTTTGG 


4380 


L. AAL v_ VjAIj{_ A 


AGAATATCTT 


CTCGCTTAAT 


GGCCCCTTGA 


CGTAAGATTT 


TCACCTTGTC 


4440 


1 LLL bALAAA 


TTCAAAATAG 


TTGAATCCTG 


TCCAGTTAGA 


AAAGCATCGT 


CTTCCAGACC 


4500 


CAGAACCTCT 


TGGTCAAAAT 


CCTCTAGAAT 


TTGATTAAAG 


GTCACTCCAC 


TCGCCTGACC 


4560 


TGAGATATTG 


GCAGACGGCC 


CAATCAAGGG 


ACCTGTCTCT 


CGAATCAAAT 


CAAGGGTAAT 


4620 


GGGATGACTA 


GGCATCCGAA 


ATCCAACAGT 


TGCAAGGCCA 


GAATTGACCC 


AATAGGGAAC 


4680 


TCGGTCATTA 


GCTTCGAGAA 


TAATGGTCAA 


GGGACCTGGT 


AAAAAG AT CT 


CTACAAGTTT 


4740 


TTGAAGATAA 


GTTGGCTGAT 


TCTTTGAAAA 


GTACAAGATG 


TCCTCTAAAG 


AGGCAACATT 


4800 


GAGATTGAGC 


GCCTTGTCTC 


TACGTCGACG 


TTTAAGCTGG 


TAAACATGGT 


CAACTGCTTT 


4860 


TTCGTCTAGC 


GCCTTAGCAA 


AGAGACCGTA 


AACTGTCTCT 


GTAGGCAAAA 


CGACAGCTCC 


4920 


ACCATTTTCC 


AACTCTTGTC 


TAATCCTGTC 


CATCATCAAC 


GACAACCATC 


CTATCTTGAC 


4980 


CAAATTGGTC 


CTTGAGTGTT 


CGTACTCGCT 


TTTCAGGAAG 


ATGTTTCCTA 


AAAAGTTCAG 


5040 


GAACACTTTG 


ACCTTGCTTG 


TATCCAATTT 


CAAGGTAAAT 


CTTACCACCA 


TCTTTGAGAT 


5100 


AGTCTTTTGC 


ATCTTCCGCA 


ATTCTACGGT 


AAATAGCTAG 


GCCATCCTCA 


TCTGCAAAGA 


5160 


GAGC T AG AT G 


AGGCTCCGAA 


TACAAGACAT 


TCAAGCCTAC 


CTCTGACTCA 


TCTTCACGAG 


5220 


AGATATAGGG 


TGGATTGGAA 


ACAATTATAT 


CATATTTTTC 


AGAAATTTCT 


GTAAAACAGT 


5280 


CAGATTTTTT 


TAAAAATATT 


TGAAGATTTT 


GATTTTTAGC 


ATTTTCGCTA 


GCTACATCTA 


5340 


AAGCATCTTG 


GGAAATATCT 


GCTGCCGTCA 


CTGACCAATC 


TGGTCTGTTT 


TTTGCTAGAG 


5400 


CGAGAGCAAT 


AGCTCCACTA 


CCTGTTCCGA 


TATCTAGGAC 


CATAAGATTT 


TTCACAGGAT 


5460 


TTTCAGCCAG 


GATAAGCTCC 


ACCAACTCCT 


CTGTTTCTGG 


ACGAGGAATC 


AAAACCCGTT 


5520 


CATCCACCTT 


TAAATGCATT 


C C AT AAAAAT 


CTGCCTGTCC 


AATGATGTAC 


TGAGCTGGCT 


5580 


TGTGAGCTGC 


TAGTTGCTGG 


TAAATATCTT 


CTACAAATTG 


TTTTTCTTCC 


TCTGTTGTCA 


5640 


CCTCCTGCTG 


GAGGGCAAAA 


ATAAAGTCTG 


TAAAAGATAG 


ATTTTTCAGA 


CTACGATAGA 


5700 


CAAAAGAGAG 


GCTTTCCGCT 


TCCTCTCCTT 


GTCTTATCAA 


CTCTTCTTCA 


AAATTTGAAA 


5760 


ATAATTGAGC 


TAATTTCATT 


ATTTGTTTAA 


TTCTTCTAGT 


TTTTGTGTTT 


GGTCATAAAG 


5820 


CACCAAGGCA 


TCCACAACTT 


CGTCCAATTT 


ACCAGACAAA 


ATCGTATCTA 


GTTTTTGGAG 


5880 


GGTCAAGCCG 


ATACGGTGGT 


CTGTGACACG 


GTTTTGTGGG 


AAGTTATAAG 


TTCGGATCCG 


5940 


TTCTGAACGG 


TCACCAGTAC 


CGATTGTCGA 


CTTACGCTCA 


GCGTCCTGCT 


CATCTTGAGC 


6000 


AATCTGAGCA 


AAGTGGTCAG 


CAACACGGGC 


ACGGATGATT 


TTCATGGCCT 


TCTCACGGTT 


6060 
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CTTCTGCTGG GTACGTTCTT CCTGCATCTC AACCTTGATA TTGGTTGGCA AGTGAACGAT 612 0 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CGTGATAGAT 6180 

GTCGACACGA AGGTCTTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGGCATAAC 6240 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG GAACACGTTG 63 00 

CACACGGTGG GCACCTGATT CATACTTAAG CTTAGAGTAT ACAGACTGAC CTGAAACCAT 63 60 

AGCAACCACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 6420 

CCAACCTTGG GCTTCCGCAT ACTTTTGGTA CATAGTTAGC AAATCTCCAG C G AAAAGTG C 6480 

CGCTTCGTCT CCACCAGCTG CTCCACGGAT TTCAAGGATG ATATTCTTGT CATCGTTTGG 6540 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTGGC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAAGTCCGCA TCTCCGCCTG ATTCCTTAAT 6660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 672 0 

GGTGTCACGA TTGGAAGCTT CTTCTTTTGA AAGC TCC AT A AAACGCTTGG TGTCTGAAAC 67 80 

G AC AT C AGG G T C ACT C AGC A ATTCTCCTAA TTCTTCATAA CGGTCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT AGATTGCTAC 6900 

TACTTCATTC TCGGATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 69 60 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTGCTGGCGC 702 0 

TGTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATAGTAA ATAATTTCCA CTACTAATTC CGATCTTTTT 714 0 

ATCATAAGCT ATCATCACAA AACCTATAAT TG CATC ATT A TCATAAACTG CCAATGGAGC 72 00 

TACAAAATCT CCATTTTTAG TGTAGACGTA TGCTTCAGCT AAACTAATTG CGTTGGTTGC 72 60 

AATGAATTGT TTTTGATATT CCTTGACATC CAAATTTAAA ACATCAAAAT AATTTTCCAT 7320 

TGTAACATCT CTTAGTTCAA TTGTCATAGT TTTGCTCCTT GTTAGAGGTT ATCATTGGCG 73 80 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGGATCTGTT 7440 

CTCCATCGTA AACGGGCAGT CCATCCTGTG TTCGCAACAC CATGGTCGCC TTTTTCTTGC 7500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7 5 60 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAAGCC AAAAGCCATG ACGGGTATGT 7 62 0 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GCGTTTGAGA AACTGGGCTT 7680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGGTCTCG GAT AT AG CCA AAGATATCCG 7740 

TTGTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCG ACT C G AC AC AT AG C C AA 7 800 

CGCCGTCACG CGTATCCAGA GCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 7860 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTTACCAGA GTTCATGGTC CCATAACGAT 7 92 0 

AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 7980 

TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTGG TAAAATAGAA GAAATCAAAA 8040 

ACTAGTGGAG GAAGCTATTA TGCCATTTGT ACG CAT CG AT TTATTTGAAG GACGCACGCT 8100 

CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA AC AC TGG AG C 8160 

CCCTCAATCT GCTGTCCATG TCATCATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 822 0 

AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 82 8 0 

CAAGTAGCAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 8340 

AATTTCCAAG AAAAGAGCTA TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 84 00 

CCATCGCTGC TATCTCTACT CCACTAGGTG AAGGGGCTAT TGGTATTGTC CGCCTGAGCG 846 0 

gAACAGACAG TTTTGCTATT GCGCAAAAGA TTTTTAAAGG AAAAGACTTG AACAAGGTTG 852 0 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTGGTAAA GTCATGGACG 8580 

AGGTTATGGT TGGGGCTATG AAGTCTCCAA AGACCTTCAC TCGTGAGGAT ATTATCGAGA 8 64 0 

TTAACACCCA CGGTGGGATT G CGGTG AC C A ATGAAATTCT CCAGCTAGCT ATTCGTGAAG 87 00 

GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTGC TTTTTTAAAC GGTCGCGTAG 87 60 

ACTTGACACA GGCAGAGGCT GTGATGGATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 8 82 0 

ACATTGCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT CATTAACAAT ACCCGTCAAG 8880 

AAATCCTCAA T AC AC T TG C C CAAGTTGAGG TCAATATCGA CTATCCTGAG TATGACGATG 894 0 

TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 9 000 

CCAAACTCCT TAGGACAGCA CGTCGTGGTA AAATCCTTCG TGAAGGAATT TCAACGGCTA 9060 

TCATTGGACG TCCCAACGTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 912 0 

AGGCTATCGT AAC AG AT AT C GCTGGGACAA CACGAGATGT CATCGAAGAG TACGTCAACA 9180 

TCAATGGTGT ACCTCTCAAA TTGATTGATA CAGCCGGTAT TCGTGAAACG GAT G AT AT CG 92 40 

TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAAGC TG AC CTAGTTCTGC 93 0 0 

TAGTACTAAA CGCTAGTGAA CCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 93 6 0 

AGGAGACTAA TCGCATTATT CTTCTTAACA AAACTG AC C T GCCTGAAACG ATTGAAACTT 942 0 

CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC AT CG AT AAAA 9480 

TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGG TTTGGTTGAG CAAGATGCTA 9540 

CCTACTTGTC AAACGCCCGT CACATTTCCT TGATTGAGAA GGCCGTTGAA AGCCTACAAG 9600 
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CTGTTAACCA AGGTCTTGAA CTAGGGATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 9660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 9720 

AACT CT T TAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 97 80 

TGGATTTTAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA C TAT AG AT AT TATGGAGCCT 9 840 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9960 

TAATATCCAG ATTTTAGACA TCATCAATAA GGATACACAC AAGGAAATCA TCGCCAAACT 1002 0 

GGACTACGAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT T AT CTT G AAA CGACTGGTAT GC C C AC TAG A ATTCTCCTTA 1014 0 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 103 2 0 

GTAAGCTCAA TG ACTTT C AC TTTAAACATG ATTTTTCTTG TCTTCCTGAG AT TAT GT CTT 103 80 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 



(2) INFORMATION FOR SEQ ID NO : 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GTTTTGGGTT CTGGAAATTA TCAGATGGTT GGAAAAGCCG TCCACATCAA GATAGTGTTC 6 0 

GGAGATTTAA GTTTAAATTG AAGAAACTAA CACAGAGGAA ATGGAGTATA GACCTAACAA 12 0 

GACGTATTGA GCAACTGAAT TTGTCTATTC GAGGATGGAT AAACTATTGC TCATTGGGAA 180 

ATATGAAAAG TATAGTCGCC AGCATAGATG AGCGCTTGCG TACTCGCCTA CGAGTGATTA 240 

TCTGGAAGCA ATGGAAGAAG AAAT CG AG AC GATTATGGGG ATTGCTTAAG TTAGGAGTTC 300 

CTAAATGGAT AGCAGATAAG GTATCTGGCT GGGGCGACCA TTATCAATTA GTAGCTCAGA 3 60 

AGTCGGTACT TAAACGTGCT ATATCAAAAC CAGTCCTGGA AAAACGTGGA CTGGTTTCGT 42 0 

GTTTGGATTA TTACCTTGAA CGACATGCGT T AAAAGT T AG TTGAACCGCC GTATGCCAAA 48 0 

CGGCACGTAC GGTGGTGTGA GAGGGGCTAG AGATTATCCC CT ACT CG AT T AACTCCCCTG 54 0 

AAATTTATTT TAATTATGCA AATTTCACGT ATTTTTGATG CTGAGACGAC GATCCTGGGA 600 
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ACTTTTCAGA 


TATTTTTTTG 


ACT AT CT AAA 


TCTATCATTA 


GAAAAGCTTA 


GAGCGCCAAA 


660 


GGATTTGAGC 


GTTTTTCTGA 


TTTTTAAGAC 


TTTTTCCAGT 


CTCTTTTTCG 


ATTGAAGATG 


720 


TAATTATTCT 


ACTAACTAAC 


TAACTTCTTA 


GTACTAGCCA 


ACAACGATAA 


TCATAATTCC 


780 


TCCTAAAATT 


AGGAATAATA 


AAGGCAATAG 


TTTTTGTTTT 


TTCATGTAAA 


AAACCTCACT 


840 


TTTGTTTTCT 


GCTATTTTAT 


GCTAAAATAT 


TAAAAATCAA 


ATTTAATTCC 


AAAGTTTGTA 


900 


ACTAAAGGGG 


GAGCGCTACA 


TGTCTAATTC 


ATTTGTCAAG 


TTGTTAGTCT 


CTCAATTATT 


960 


TGCAAATTTA 


GCAGATATTT 


TCTTTAGAGT 


AACAATCATT 


GCTAACATAT 


ACATTATTTC 


1020 


AAAATCAGTA 


ATTGCCACAT 


CACTAGTTCC 


TATCTTAATA 


GGAATATCCT 


CTTTTGTTGC 


1080 


GAGTCTTTTA 


GTTCCGTTGG 


TTACTAAAAG 


GTTAGCGCTA 


AATAGGGTTT 


TATCTTTATC 


1140 


TCAATTTGGA 


AAGACTATAT 


TATTGGCGAT 


ACTGGTAGGA 


ATGTTTACCG 


TAATGCAATC 


1200 


CGTAGCGCCT 


TTGGTGACCT 


ATCTATTTGT 


TGTTGCAATT 


TCCATACTAG 


ATGGTTTTGC 


1260 


AGCACCCGTT 


TCCTATGCTA 


TTGTGCCACG 


CTATGCGACC 


GATTTGGGTA 


AGGCTAATTC 


1320 


AG C CTT AT C A 


ATGACTGGTG 


AAGCTGTTCA 


ATTGATAGGT 


TGGGGATTAG 


GTGGACTCTT 


1380 


GTTTGCAACA 


ATTGGTCTGT 


TACCTACCAC 


GTGTATCAAT 


TTAGTCTTGT 


ATATCATTTC 


1440 


TAGCTTTCTG 


ATGTTATTTC 


TTCCTAACGC 


TGAAGTGGAG 


GTGTTAGAGT 


CAGAAACTAA 


1500 


TCTTGAAATT 


TTGCTCAAAG 


GTTGGAAGTT 


AGTTGCTAGA 


AATCCTAGAT 


TAAGACTTTT 


1560 


TGTATCAGCA 


AATTTATTGG 


AAATTTTTTC 


AAAT AC G ATT 


TGGGTTTCTT 


CCATTATACT 


1620 


TGTTTTTGTA 


ACGGAGTTAT 


TAAATAAAAC 


GGAAAGTTAC 


TGGGGATATT 


CTAATACAGC 


1680 


ATACTCTATT 


GGTATTATAA 


TTAGTGGCTT 


AATTGCTTTT 


AGGCTATCTG 


AAAAGTTCCT 


1740 


TGCTGCTAAA 


TGGGAACCCC 


AATTATTCAC 


CCCAAATCTA 


AAAACCATCC 


AGAATCCTTG 


1800 


CCTTAGCTTA 


GATCCTGGAT 


GGTTTCTTTT 


TTCACCCAAT 


GGGTGTTTTT 


TACTAGACAA 


1860 


AAAAGAGTTT 


CCCCTTTATG 


GTATAAGTGT 


AGAAAAAAAC 


ACAAAAAGAA 


AGGAAACTCA 


1920 


CATGAACAGT 


TTACCAAATC 


ATCACTTCCA 


AAACAAGTCT 


TTTTACCAAC 


TATCTTTCGA 


1980 


TGGAGGTCAT 


TTAACCCAGT 


ATGGTGGTCT 


TATCTTTTTT 


CAGGAACTTT 


TTTCCCAGTT 


2040 


GAAACTAAAA 


GAGCGGATTT 


CTAAGTATTT 


AGTAACGAAT 


GACCAACGCC 


GCTACTGTCG 


2100 


TTATTCGGAT 


T C AG AT AT C C 


TTGTCCAGTT 


CCTCTTTCAA 


CTGTTAACAG 


GTTATGGAAC 


2160 


GGACTATGCT 


TGTAAAGAAT 


TGTCAGCTGA 


TGCCTACTTT 


CCAAAATTGT 


TGGAAGGAGG 


2220 


GCAGCTTGCT 


TC AC AG C C AA 


CCTTATCCCG 


TTTTCTTTCC 


AG AACTG AC G 


AGGAAACAGT 


2280 


CCATAGTTTG 


CGATGCCTCA 


ACCTTGAATT 


GGTCGAATTC 


TTTTTACAGT 


TTCACCAGCT 


2340 
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AAACCAACTC ATTGTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 23 93 

(2) INFORMATION FOR SEQ ID NO : 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



TTTGTATCTT 


TTTAGGTCTC 


TTTCAATCCA 


AACCCTTTAA 


ACTATACGTC 


ATTTCGGTTC 


60 


CTGCAAGTCT 


TGTGGTAATT 


TTAGGTTTGA 


TTTTACTTTT 


CTTTTCACAA 


GAGCCTCTGC 


120 


ACGCTTCTTA 


TTTGATGGTC 


GTCTTCCCTG 


TTTTCCTACT 


TTTATTGGTA 


ACCAATATTA 


180 


AGAGTCAACA 


GAGGGGGCGT 


AGTGCTAGAA 


GAAGCCGAAG 


AGAAACGCCA 


TT AT GC CT AT 


240 


GGAGTCGTTT 


CTTCAAAGGA 


AATCTATATC 


TGCTAGTTTT 


TGGGTTTGTC 


TATCTTTTGT 


300 


CTGTTCCTTT 


TTTGATGAAG 


TTTGTCCTTT 


ATCCAGTACC 


TTATCAAGAA 


CGTAATCGTC 


360 


TTGCTGATTT 


GGTAAAAGAG 


GAGACAAATA 


CGGAAGATGC 


TAT CT CAT GC 


ATGGGATGAT 


420 


ACTGCGACTC 


TTTATCGTAA 


GAGTGAGCGC 


TTGTCCCATC 


GGCGATTTTG 


TCCCCGTTGC 


480 


AC TAT AC AGC 


AACTGAGGAA 


AATCGTAATA 


AGTTACTTAA 


TG ACT T G AAA 


GAAAAACAAC 


540 


CTAAGGTGAT 


TGTGGTAAAT 


GATAAGGTGG 


TAGTCTGGTC 


TGAAGTGGAA 


ACACTCTTAA 


600 


AAGAAAATTA 


CCAACAAGTA 


AAGACTGATT 


ACTCAGAGTT 


TAAAGTCTAT 


AAAATTAAAT 


660 


AACCAAATCA 


ATATCTTGTG 


T ATT T TT AAA 


AATTTTAGGA 


TTTTTAACAC 


AAGATATTGA 


720 


TTTTTCTTTT 


TAGAGTGGTA 


TAATACTTTT 


TAGAAAGAAC 


AT TT TAG AAA 


AGAGCATGCA 


780 


TATGATTGCA 


CTAGAAGAAA 


AAATTACAAT 


TTTGCCAACT 


CTCTTCGTCG 


AGAAACGAGA 


840 


TGGGAGACGT 


GTTGTATTTG 


ATGTGGACAA 


GATTGACAAG 


GCTCTCCACA 


AGGCGGCTGA 


900 


CAAGGTTATG 


GATGTGACAC 


CCCTGGTTGA 


AAAATGCCTC 


AATGATCTGA 


C TG AGC G AAT 


960 


TATTACAGAA 


ATTCATAGTC 


GCTTTCCACA 


GGGAATTAAG 


ATTTACGAAA 


TTCAAAATAT 


1020 


CGTAGAACAT 


GAACTCCTTG 


AAGCCAAAGA 


ATATGCGCTG 


GCTGAGGAGT 


ATATTACTTA 


1080 


T CGG AC AC AG 


AGGGATTTTG 


AGCGCTCAAA 


AGCGACGGAT 


ATCAACTTTA 


GTATTCATAA 


1140 


ACTTCTCAAC 


AAAGACCAGA 


CAGTTGTCAA 


TGAAAACGCT 


AATAAAGACA 


GTGATGTCTT 


1200 


TAACACTCAG 


CGTGATTTGA 


CAGCAGGGAT 


TGTTGGGAAA 


TCAATCGGAC 


TGCAAATGCT 


1260 


TCCTAAGCAC 


GTAGCCAATG 


CCCACCAAAA 


GGGGGATATC 


CACTATCACG 


ATTTGGACTA 


1320 


CAGTCCCTAT 


ACCCCTATGA 


CCAACTGCTG 


TTTGATTGAT 


TTTAAGGGTA 


TGTTGGAAAA 


1380 
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TGGTTTTAAG 


AT T GG AAATG 


CAGAGGTAGA 


GAGTCCCAAG 


TCTATCCAGA 


CTGCGACAGC 


1440 


ACAGATTTCT 


CAAATCATTG 


CCAACGTTGC 


TTCTAGCCAG 


TACGGTGGCT 


GTTCAGCTGA 


1500 


CCGTATCGAT 


GAAATTTTGG 


CGCCTTATGC 


AGAGAAGAAT 


TATCAAAAAC 


ATCTCAAAGA 


1560 


TGCAGAAGAG 


TGGGTATTGC 


CTGAAAAACA 


GGAAGATTAC 


GCTTGGAAGA 


AAGCGCAAAA 


1620 


GG AC AT C T AC 


GATGCCATGC 


AATCTCTTGA 


GTATGAAATC 


AATACTCTCT 


TCACTTCAAA 


1680 


TGGACAAACA 


CCTTTTACTT 


CGTTAGGTTT 


TGGTCTGGGA 


ACCAGTCGTT 


TTGAACGAGA 


1740 


AATTCAAAAA 


GCTATTTTAA 


ACATTCGCAT 


CAAGGGTCTT 


GGTTCAGAAC 


ACCGTACGGC 


1800 


TATCTTTCCT 


AAACTTATCT 


TTACGCTTAA 


AAGAGGCCTC 


AACTTAGAGG 


AAGGAACTCC 


1860 


C AACT AT G AC 


ATCAAGCAGT 


TGGCTCTAGA 


GTGTGCAACC 


AAGCGGATGT 


ATCCAGACGT 


1920 


CTTGTCTTAT 


GATAAGATTG 


TTGATTTGAC 


AGGTTCTTTC 


AAGGTGC C T A 


TGGGCTGCCG 


1980 


TTCTTTCCTT 


CAAGGGTGGA 


AGGATGAAAA 


TGGTGTAGAA 


GTCAATTCAG 


GTCGCATGAA 


2040 


TCTGGGTGTT 


GTGACGGTTA 


ATCTGCCTCG 


TATTGCTCTT 


GAGTCTGAAG 


GT G AT AT G AA 


2100 


TAAGTTCTGG 


GAAATCTTCA 


ACGAGCGAAT 


GAATATCGCA 


GAAGATGCTC 


TTGTTTACCG 


2160 


TGTCGAACGC 


ACTAAAGAGG 


CGACACCAGC 


GAATGCTCCT 


AT T CT TT AT C 


AGTACGGTGC 


2220 


TTTTGGCCAT 


CGTCTAGGTA 


AAGAAGAAAG 


TGTTGACCAG 


CTCTTTAAGA 


ATCGTCGTGC 


2280 


GACCGTTTCG 


CTGGGCTATA 


TCGGCTTGTA 


TGAAGTAGCG 


ACAGTTTTCT 


TTGGTAACAG 


2340 


CTGGGAAAGT 


AATCCAGATG 


CTAAGGAATT 


C AC G CT AG AC 


ATCATTCACG 


ATATGAAACG 


2400 


CCGTGTAGAA 


GAGTGGTCAG 


ACCAATATGG 


CTACCATTTC 


TCTATCTACT 


CAACACCATC 


2460 


CGAAAGTCTG 


ACAGACCGTT 


TCTGCCGACT 


AGATATAGAC 


AAGTTTGGCT 


CTATTCCTGA 


2520 


TAT C AC AG AC 


AAGGAATACT 


ACACCAACTC 


TTTCCACTAC 


GATGTTCGTA 


AAAATCCAAC 


2580 


ACCGTTTGAA 


AAATTGGACT 


TTGAGAAAGT 


CTATCCGGAA 


GCAGGTGCGT 


CAGGTGGTTT 


2640 


CATCCATTAT 


TGTGAGTATC 


CAGTCCTTCA 


GCAAAATCCA 


AAGGCCTTGG 


AAGCTGTCTG 


2700 


GGATTATGCT 


TATGACCGTG 


TAGGCTATCT 


AGGCACCAAT 


ACTCCGATTG 


ACCGTTGCTA 


2760 


CAAGTGTGAC 


TTTGAAGGGG 


ATTTTGAACC 


AACTGAGAGA 


GGGTTTGCTT 


GTCCAAACTG 


2820 


TGGCAATAGC 


GACCCTAAAA 


CAGTAGATGT 


GGTGAAACGA 


ACTTGTGGCT 


ACCTAGGTAA 


2880 


TCCTCAAGCA 


AGACCGATGG 


TCAACGGGCG 


TCACAAGGAA 


ATCGCTGCGC 


GTGTCAAACA 


2940 


TATGAATGGT 


TCAACGATTA 


AAATAGCTGG 


GCATCAAGTA 


ACAAATTAGA 


AAGAAATGAA 


3000 


ATGGGAAAAT 


ATCAACTAGA 


CGATAAGGGG 


CGCGCACAAG 


TGACCCGTTA 


T C ACG AG AAA 


3060 


CACTCTAAAG 


GTGGAGCTGG 


TAAGAAAGAA 


CGCTTGCTTA 


G CTT C AG AG A 


ACAATTTTTA 


3120 
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AACAAGAACA AGAAAAAATA AAAGTGAGAG CCAGCTCTCG CTTTTCTCAT AGTGGGAGGT 3180 

AAGGATGGAA TTACGCAGAC CAAGATTAGC GGATAAGAAA GCTGTTTTAG ATATGATGAC 324 0 

AGAGTTTGAA AAATTTCAGT CGCCTCACGA CGGCGGTTTC TGGGATACAG AGAACTTTGT 3300 

GTATGAAGAC TGGTTAGAAA GCAATCAGGA ACAGGAAATG GGGATTAATC TGCCTGAAGG 3 3 60 

ATGGGTTTCT GCAATTCAGT TAGTGGCTTT TTCTGAGAAA GGTCAAGCAG TTGGATTTCT 342 0 

TAATCTCCGG TTGCGCCTCA GTAACTTTCT ACTAGAAGAA GGTGGCCACA TTGGCTACTC 3480 

CATTCGTCCA TCTGAAAGAG G C AAGGGTT A TGCAAAAGAG ACTCTCCGTC AGGGCTTGCA 3 54 0 

AGTTGCTAAG GAAAAGAACA TCAAGAAAGC TCTGGTGACC TGTAGTGTGA ATAATCCTGC 3 600 

T AG C AG AG C A GT C AT T C TAG CAAATGGTGG AATATTTGAG GATGCTCGCA ATGGAGTCGA 3 6 60 

GCGTTATTGG ATAGAGGTAG CGAATGAATA ATCCAAAACC ACAAGAATGG AAAAGCGAGG 3720 

AACTTAGTCA AGGTCGTATC ATTGACTACA AGGCCTTTAA CTTTGTGGAC GGCGAAGGCG 3780 

TGCGCAACTC TCTCTATGTA TCAGGCTGTA TGTTTCACTG CGAGGGATGT TATAATGTTG 3840 

CGACTTGGTC TTTTAATGCT GGCATTCCCT ATACAGCAGA ATTAGAAGAG C AG ATT AT GG 3 900 

CAGACCTTGC CCAACCCTAT GTTCAAGGCT TGACTTTGCT GGGAGGGGAG CCTTTTCTCA 3 960 

ATACTGGGAT TCTCTTGCCA CTTGTTAAGC GGATTCGGAA GGAATTGCCA GACAAGGACA 4 02 0 

TCTGGTCCTG GACCGGCTAC ACT TGGG AAG AAATGATGTT GGAAACTCCA GATAAACTGG 4080 

AATTCTTGTC ACTGATTGAC ATTCTTGTCG AT GG AAG AT A TGATCGAACT AAGAGAAATC 414 0 

TTATGCTCCA GTTTCGAGGT TCATCTAACC AACGAATTAT CGATGTGCAA AAATCGCTCA 42 00 

AAAGTGGGCA AGTAGTGATT TGGGACAAGC TCAATGACGG AAAAGAAAGC TATGAACAGG 42 6 0 

TGAAGAGAGA ATGAAGAAAA AGGACTTAGT AGACCAACTA G T C T C AG AG A TCGAGACGGG 4320 

GAAAGTCAGG AC AC TGGG AA TATACGGTCA TGGAGCTTCA GGTAAATCAA CCTTTGCACA 4 380 

GG AAT T GT AC CAAGCTTTAG ATT CT ACT AC AGTAAATTTG C TAG AG AC AG ATCCTTATAT 444 0 

CACCTCAGGA CGCCATCTGG TAGTACCCAA GGACGCGCCG AATCAAAAGG TGACAGCCAG 4 500 

TCTGCCAGTG GCGCATGAAC TGGAGAGTTT GCAGAGAGAT ATCCTTGCTT GCAGGCGGGT 4 560 

ATGGATGTCT TGACAATTGA AGAACCTTGG AAGGCTAGTG AGGTCTTGTC TGGAGCCAAA 4620 

CCAATTTTGA TTGTCGAAGG GATGTCTGTT GGCTTTCTAC CCAAGGAACT CTTTGAAAAA 4680 

AC C AT CTGTT TCTACACGGA TGAGGAGACC GAATT AAAG C GACGCCTTGC TAGAGATACG 4740 

ACTGTGAGAA ATCGCGATGC GG 47 62 
(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
GATGCAGGTT TCGACCCACA TATTCCAGAA AATTACTTTA AAGATGATGA TGTTAATCAG 60 

GTACCTTGTC TTTGTTGGTC TTCATCTGCA GCCCTCTTTT TCAGTAATTG GGTAGACCAT 12 0 

GCGGTCTATC AGGAGACGCC TTTTGATTGG AGAAAGATAG AAGATGATGC ATCTGCATAT 180 

GGGT AT TT AT AAGAGGAATT ATGACATATT TAGACGCTTT TAAATCAGGT ACCTTGGTTT 2 40 

TACCGAGTGC CCTGCTCTTG CATTTTAAGG AACTCTTTCC TTCTAGCGAC GATTTTCTGG 300 

TTTGGCAATT TTTCTATTTG CAAAATACGA CAGGCTTAGA AGAAATGTCG CCAAGCCAGA 3 60 

TTGCTGAAAG GATTGGCAAG GAAATTTCGG ATGTCAACCA GTCCATTTCT AATCTGACGG 42 0 

AAAGGGGACT GCTCCAGTAT CGT ACT AT C G AATTAAATGG CGAAATTGAA TTGCTCTTTG 4 80 

ATGCTAGTTT GGCCTTGGAA CGTTTGGATG ACCTGTTTGG AGCAGTTCAT TCAAGTTCAG 54 0 

ACCAGCTAAC ACCTCAAAAC CAGCTCAAGG ATTTGGTGGA AACCTTCCAG CAGGAGTTGG 600 

GACGATTGTT GACGCCTTTT GAGATTGAGG ATTTGACCAA GACACTAAAG GAAGATGGAA 6 60 

CCAGTGCTGA CTTGATTAAG GAGGCTCTTC GTGAAGCTGT TTTGAATGGA AAACCAAACT 72 0 

GGAAGTACAT TCAGGCGATT TTGAGAAACT GGCGCCATGA AGGAATCAAG AGTGTGGCTC 7 80 

AAATTGAGGC CAAGAGAGCA GAAAGAGAAG CAAGCAATCC TCAGTTGACA CAGGTATCTG 84 0 

CAGATTTCAT AAATGCCATG GATCTCTGGA AGGATTAATC CATGCAAGTA GGCTTGAAAT 9 00 

CCGAGTAAGA TTTGCAAGCT GTGTATAATT GTGATAGAAT AAATAGAAAA TAAATTGAAA 960 

AAAGAGGTAT GTGAAATGTC ACGTAAACCA TTTATCGCTG GTAACTGGAA AATGAACAAA 102 0 

AATCCAGAAG AAGCTAAAGC ATTCGTTGAA GCAGTTGCAT CAAAACTTCC TTCATCAGAT 1080 

CTTGTTGAAG CAGGTATCGC TGCTCCAGCT CTTGATTTGA CAACTGTTCT TGCTGTTGCA 114 0 

AAAGGCTCAA ACCTTAAAGT TGCTGCTCAA AACTGCTACT TTGAAAATGC AGGTGCTTTC 12 00 

ACTGGTGAAA CTAGCCCACA AGTTTTGAAA GAAATCGGTA CTGACTACGT TGTTATCGGT 12 60 

CACTCAGAAC GCCGTGACTA CTTCCATGAA ACTGATGAAG ATATCAACAA AAAAGCAAAA 132 0 

GCAATCTTTG CGAACGGTAT GCTTCCAATC ATCTGTTGTG GTGAATCACT TGAAACTTAC 13 80 

GAAGCTGGTA AAGCTGCTGA ATTCGTAGGT GCTCAAGTAT CTGCTGCATT GGCTGGATTG 1440 

ACTGCTGAAC AAGTTGCTGC CTCAGTTATC GCTTATGAGC CAATCTGGGC TATCGGTACT 1500 
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GGTAAATCAG CTTCACAAGA CGATGCACAA AAAATGTGTA AAGTTGTTCG TGACGTTGTA 15 60 

GCTGCTGACT TTGGTCAAGA AGTCGCAGAC AAAGTTCGTG TTCAATACGG TGGTTCTGTT 1620 

AAACCTGAAA ATGTTGCTTC ATACATGGCT TGCCCAGACG TTGACGGTGC CCTTGTAGGT 16 80 

GGTGCGTCAC TTGAAGCTGA AAGCTTCTTG GCTTTGCTTG ACTTTGTAAA ATAATCAGTA 1740 

AGTAGCAAAA GCTAGGTGGA ACAGCATTCA GATGTCTGTT ACATTTTTTA TAGGAGAGAA 1800 

AGATTGAAAA CAAAAATTGG ATTAGCAAGT ATCTGTTTAC TAGGCTTGGC AACTAGTCAT 18 60 

GTCGCTGCAA ATGAAACTGA AGTAGCAAAA ACTTCGCAGG ATACAACGAC AGCTTCAAGT 192 0 

AGTTCAGAGC AAAATCAGTC TTCTAATAAA ACGCAAACGA GCGCAGAAGT ACAGACTAAT 19 80 

GCTGCTGCCC ACTGGGATGG GG ATT AT TAT GTAAAGGATG ATGGTTCTAA AGCTCAAAGT 2 040 

GAATGGATTT TTGACAACTA CTATAAGGCT TGGTTTTATA TTAATTCAGA TGGTCGTTAC 2100 

TCGCAGAATG AATGGCATGG AAATTACTAC CTGAAATCAG GTGGATATAT GGCCCAAAAC 2160 

GAGTGGATCT ATGACAGTAA TTACAAGAGT TGGTTTTATC TCAAGTCAGA TGGGGCTTAT 222 0 

GCTCATCAAG AATGGCAATT GATTGGAAAT AAGTGGTACT ACTTCAAGAA GTGGGGTTAC 22 80 

ATGGCTAAAA GCCAATGGCA AGGAAGTTAT TTCTTGAATG GTCAAGGAGC TATGATGCAA 234 0 

AATGAATGGC TCTATGATCC AGCCTATTCT GCTTATTTTT AT C T AAAATC CGATGGAACT 2 4 00 

TATGCTAACC AAGAGTGGCA AAAAGTGGGC GGCAAATGGT ACTATTTCAA GAAGTGGGGC 24 60 

TATATGGCTC GGAATGAGTG GCAAGGCAAC TACTATTTGA CTGGAAGTGG TGCCATGGCG 2 52 0 

ACTGACGAAG TGATTATGGA TGGTACTCGC TATATCTTTG CGGCCTCTGG TGAGCTCAAA 2580 

GAAAAAAAAG ATTTGAATGT CGGCTGGGTT CACAGAGATG GTAAGCGCTA TTTCTTTAAT 264 0 

AATAGAGAAG AACAAGTGGG AACCGAACAT GCTAAGAAAG TC ATT GAT AT TAGTGAGCAC 2 7 00 

AATGGTCGTA TCAATGATTG GAAAAAGGTT ATTGATGAGA ACGAAGTGGA TGGTGTCATT 2760 

GTTCGTCTAG GTTATAGCGG TAAAGAAGAC AAGGAATTGG CGCATAACAT TAAGGAGTTA 2 82 0 

AACCGTCTGG GAATTCCTTA TGGTGTCTAT CTCTATACCT ATGCTGAAAA TG AG AC CG AT 2 880 

GCTGAGAGTG ACGCTAAACA GACCATTGAA CTTATAAAGA AATACAATAT GAACCTGTCT 2 940 

TACCCTATCT ATTATGATGT TGAGAATTGG GAATATGTAA ATAAGAGCAA GAGAGCTCCA 3 000 

AGTGATACAG GCACTTGGGT TAAAATCATC AACAAGTACA TGGACACGAT GAAGCAGGCG 3060 

GGTTATCAAA ATGTGTATGT CTATAGCTAT CGTAGTTTAT TACAGACGCG TTTAAAACAC 3120 

CCAGATATTT TAAAACATGT AAACTGGGTA GCGGCCTATA CGAATGCTTT AGAATGGGAA 3180 

AACCCTCATT AT T C AGG AAA AAAAGGTTGG CAATATACCT CTTCTGAATA CATGAAAGGA 3240 

ATCCAAGGGC GCGTAGATGT CAGCGTTTGG TATTAAGCGA TGATTTGAAA GAGGGATGTG 3300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT CCTCGAGTAA ATTCAAGTTC 3 360 

TTGCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA CAAATACCTT GTGATTGGTA 3420 

AAACATTTTA GAAATTCATT TACCTTTCCT AATCGACTTG GTTTCATCTT ATTTCAATCT 34 8 0 

ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG GTCAGTTCTA CCTGCGACCT 3 54 0 

CAAAACTTGT GCTTTGGTCA AGCTGGGTTT AGTTTCCTAG TTTGCTGATG GATTTCCATT 3 600 

G AC TAT AAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT TCTTAAATTA TCAGTCTATT 3 6 60 

GCAACTTTTC TCATATAAGT TCTTTGTCTT GCTATTGGTT TTCCTTAGTA GTATACTAAG 372 0 

GTAGTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATG AGGTAAAGAA AATGGTAGAA 37 80 

TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA AAGCAGCGCC GG 3 832 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

TGAAAAAATC CTCATGAACC TGGCGCCAAT AG AC AAGT GT CTTGTTTCCC TCACCTTCCT 60 

TATAGGCATG GTCAGCTGAC ACTCGATTGA AGGGTTTAAC AGAAACCTTT GTAATTTCGA 12 0 

CAATGCAGAC AGCCTGATTT TGACTATCTA AAATGACATC GAAGGTCCCT ACTTGGGGAA 180 

GTGGTTCGTC TTCTAGCACA TAGAGGTCAT AGGCTGATGC TGTTGCTGTC TTTTCTCCTT 240 

TAAACACCAA ATCCGCTAAA AGGTCTGGTT CAACTCCAAA AGCCCAGGCA TCGATTTCAT 300 

CTCCGATCAA AGGATTGATT TGCTTGTATT TATTCCACAT TTCTTGCGGT ATCATGGGTG 360 

CTCCTTTGTA ATTTTTTACT TTCTTCTTTT ATGTGTTTAA GATGATCTGG ATGGTCAATC 42 0 

TCTAAATCAA AAATCTCTGG AATAGAACTG TAGTGGATAA TGCACTTGAT AC C C AACTG A 480 

TTCATTTTTT GTATGAAAGA AGTATTCAGA TAGCCTGCTA CAGCAAAATC AATCTTGTTC 54 0 

TTTCTTGCTT TATCCTGCAT ATCTCTTAGC ATATCTAACA TTATTGGACT TTCCATATCA 600 

TGCCATTGAC TGTTTCTCAT AGTCGCAAAA ACAAAGGAAG TC AAAT CAT T CATTCCAACT 66 0 

ACAATCTTTG AAATGCCCGT TTCCAGTATA CTAGATAAGT CAAAATACGC TGACGGTAAT 720 

TCAATCATCG TTCCGACTTT CCCAGTAAAA CCCTGCTGAC GCAATACTGT AATAGCTTGT 7 80 

TTTAATTGGT CGGCATCATT G AC AAAAG G A AAGATAACAG AT AG AT T GGG GTTGGTTTGA 84 0 
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TAAACTTCTG TAACGACATG TGCTTCAGCC TGAAATTCAT CCAAACACGC CAGTAAACGC 900 
CTAGTTCCTC TATAGCCAAA CAAGGGATGC CCTTCGTCAA AAAACTCTTT AGTCCCCACT 960 

AAACAATTGG CTTCTGTATT CGTTAATTCA GTAAAACGAT ACCAAACTTC CTTACCTAAG 102 0 

T AAAAGG AG C AAATAGTATC AAGATAATCT TTCACAAATT CCTGACAACT TTGTAATAGT 1080 

ATATTTTGAT TGAGCTCTCT CAATAAGTAT TCCCCACGAA TCATGCCGAC GTGGTGAAAT 1140 

AGTTGAGGAT AAATTTTTTC AAGAATTTTT TCGCCACTAA GGGCAAGTTG ATTTCTCATC 12 00 

ATTCACCTTC CAATTCATGT AAGAAGTCTT GTCCAGTTCT GGAAATCCTA ATAATTCAGA 12 60 

CTTAACCTTC AAGACTAATG GCGATGCATT TTCTTCTGTA ATCTCTTGAA TATCCATCCA 132 0 

AATATATCCA AGTGAATCAT TCGCACCATC AG AC AC AG CT TCCGAAATCG TAACTTGAGG 13 80 

TGCACTCTCA TTCATTTCAA CAT CAT AC AA GG CT AT G AC A TGGTGAACCA TAAAATTTTT 1440 

TAACTCTTCC CTGACGAAAA CATCGTAGAT TCGAGGATTA GAGTAGCTTC TAACAGTAAA 15 00 

TCCCGTCTCT TCCATAACTT CTCTAGTCAG CGTTTCCGTC AGTCCTTCAC CAAGTTGCTG 15 60 

ACTGCCTCCA GGTAGATCAT ACCGATGTTG ATAAGGGCCT CTCGTTTTTT CAATGCAAAG 162 0 

TAACTTTCCA TTTTCAAAGC AAACACAGTA GACCCCAAAG TGATTTTTGA TTTCCATCCA 16 80 

ACTCCTCCTA CTTCAAAGAC CAGCCACCAT CTATTGTCAA GATTTGTCCT TGCATGGCGC 1740 

TCGCTTTTCC ACTTGCTAAA AAAAG AC T AA GCTCTGCTAT TTCCTCTGGC TCAATCCAGC 1800 

GCTTGATTGG GGTTTCACTA GCCACCCAGT CAGCCAAACC ACCTGGTTCA AAATCCGCAG 18 60 

CGGTCATAGC TGTCTTGACT GCTCCTGGAG CGATACCAAA GACCTGAATC CCAGCTTCAG 1920 

CATAGTCTAG AGCCAACTGC TTGGTGAAGC CAGCCAAGGC ATGCTTGGAT GAAGTATAGG 1980 

CGTGACCACC TCCACCTGCT AGGCTAGAAG CAATGGAACA C AT ATTG AT G ATGATTCCCT 2 04 0 

TTTTATTTTC CAGCATTTGT GTCAAATAAT ACCGAGTCAA CTCTACTGGA ATAATGTAGT 2100 

TGATTTCAAA AATCTCTTGA ATGTCCTGCG CCGTTTGTTC CAACAGTGGT TTGTAATCAT 2160 

CCAAAACTCC AG C AGT ATT A CACAAAACAT CCACCTGAGG GCACCAGTCA AAAATAGGTT 2 22 0 

CCAAGTCCAA GGTCAAATCT CTCTGTAAAA AGCGAAAATC ACCCTCTAAG AGTGGCTTTT 22 80 

CACCTTGGTC AACT C C AT AA ACTTGATAGC CCTTCTCTAA AAAGAGGCGA GCTTGAGCCA 2340 

ATCCGATCCC TGAACTCACT CCTGTAATGA GTACACGTTT AGTCATGCAC TTCTACCCAA 2400 

TCCGTTGCCA AAAC AT C AC A AACTGTCGGG CTCCACATGG AAAAACCTTC TCCTTCGCCA 24 60 

GAAACGTTGA TTAGGAAATA AGGTGTCATT TCAAGTGCAA GCCCATTTTG CTCGATGGTA 2520 

TCAAAGAGTT GGACATAGTT TTCCGCACCT CCCCAACCAG TTCGTACATA TTTTCTCTTA 2580 

GCCTTTAACC CAGGCAGGAT CTCTTCAAAT GTCATGTTTT TCTCCTTTAA TT CT AC ATT C 2 640 
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TTCATTTAAT TATAGCAAAA AACCGCTTTA TACGGCTTTT TGAATGTGAG TTATTCAAAC 2 7 00 

C TG CT ACT AC TTACGGCAAA TTATTCCCTG CAGCAAGATA AAT T T CAT AC CATTCTTTTC 27 60 

TTGTTAAGCT AAAGTTTGCC GCTCGGCTAA CTTCTCTCAA GTGCTTAGGA TTTGTTGTAC 2 82 0 

CTACGACTGC CTGCATTTTT GCTGGATAAC GCAATATCCA AGAAATGGCA ATAGTTGAAG 2 8 80 

AGGTTACTCC ATATTTAATA G C T AAACG AT CAAGTACTTG ATTTAAAGCT TGAAATTTCT 2 9 40 

CATTTCCAAC AAAATTCCCT TTAAAATACC CGAATTGTAA G AC AG AC CAT GCTTGAATGA 3 000 

CCACATCGTG TAATTGGCAA TATTCAAAAA TGCTGCCATC TCGCATAGCT GCTTGACTAT 30 60 

CTTCCATATT AACATGAAAA GCTGATTCAA ATCCTGGAGT AAAAGCCGCA CTCAATTGTA 312 0 

GCTGATTAAC AGCTAACGGC TGCTTGACAT CTTTTTTAAG CAACTCCATC AT C AT AGG AT 3180 

TTTGATTAGA AACTCCAAAA TCTCGAACTT TACCTTGTTT ATAAAGGAGA TTAAAGGCTT 324 0 

CTGCTACTTG GTCAGATTCC ATCAAAGCAT CTGGTCGATG AAGGAGCAAG CT AT C TAG AT 3 3 00 

GAT C AAT C TT CAATCTTTGC AAAAT AC CGT CTACTGATTT TAT AAT AT AG TCCTTAGAAA 3 3 60 

AATCAAAATA GGTAAATTCT TCAATGCGAA TGCCACATTT GGACTGAATC CACATCTTTT 3 42 0 

CTCTTAAATC TGGACGATTT TTTAGGACAA GACCTAACAG TTCTTCACAA CGACCACGAC 34 80 

C AT AAAT AT C AGCCAAGTCG AAGGCATTGA TTCCAACAGA AAGTGCTGTT TCTACAAGCT 3 540 

CTTCAACTTC TTTTACAGAT TTATCTTTTA TTCTCATCAT TCCGAGAACA ATTTCTGATA 3 600 

ATTCTTTGTC ATCTTGACCA AGAGTTATGT ATCTCATCAA ATTTTTCTCC TTTAATTTCT 3 6 60 

AACATTCTTC CCTTCATTAT AACAAAAAAC CGCTTTGCAA CGACTTTTTG AC T AT ACTTC 372 0 

ACTCCATTTT ATCTTCTTAA AC C C ACGGAA CAAGACAAAG ATTCCAATAA AG AGG AC AG C 3 780 

TAAAGGAATA ACTTTTGTAA GGAAAACATT TGAAATTCCC ATCCACTCAT AATAACGGAG 3 84 0 

CAGAGAACCC ACCACAAGAT GGGCAATAAT CATACTGACA AATGGACGAA AGACCGCTTC 3 900 

TTTCCAATTC CAAATACCGA TAACTAGCGA AATCGTAAAG ACAGACAAAC TATCCCAGGG 3 9 60 

AGCCGGAATA TAAAAGGCTC CTTCTTGTAT GAAGCTTGCC ATTCCTACAT ATCCTAAAAC 4 02 0 

AACTAGAAGA ACT AT AGTC C CAACAACAAT GTAAGTGCCA ATTTTCATTT TAGGAGAATC 4080 

TTGGACTAAA CTTCTTCGTA AAATTGTGGC CACAAGTCCA AAT C C AAT C A GAAAAATAAG 4140 

AAGTTGCCCT AAAAATGTGA GCAAATTGAC TGTTAAGAGA GGACCTTTAG AAAAATCACT 42 00 

TAGTAGTTGA TAATAACGTA ATACCGCCAG GACAAGAATT GGCGTCAAAA GGGACTCTTT 42 60 

G AT AGAAC TG CGAGGTGCTC C C T TG AG AAT CTCTTTCATT ATTTTTTTAG GATTCTTACC 4320 

TAG AT AAT CC TCTGCACTCA TGCCATCTCG TTCTGCTTCT GAGAAATCTA GCATCATCAA 43 80 
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ATAGATCTGC TCTCTGAGAT AGTCTT C AT C ATAGAGAAAT CCAGCAAGAT TAAAACTTTC 4440 

CCACAACTCC TCAAAATACT TTTGATTCTC CTCAGAAAAC TCATGTAGCA AAGCGCTTGT 4500 

TTCTTCGTAA TACTTCATTT TCTTCATGGT TTAACCCCCA TTCTTAATCC CTTCTACTTT 45 60 

TTGACTCAAA TCGTCCCATT GTTGCCAAAA GACTGAGACA CGCTCTTCTC CTTCTTTCAT 462 0 

TAATGAAAAA TACTTCCGAT CTGGACCATC TGGCGACGGG CGCATGTCGC CTCTTATCCA 4680 

TTGATTTTTT TCTAACTTTT GCAACAAAGG ATAAATAGTT CCTGGAACGA TAGTATCAAA 474 0 

TCCAGCCTCT CGCAAAGTCT GAACCAACTC ATAACCATAC CGCTCTTTTT GACCAATCAT 4800 

ATCCAAGACA CAACCTTCAA GAACACCTTT TAATAGCTGA GTTTCTTTCA TCACTTCTCC 48 60 

CTTCTAATCT ATTTTGTAAT ACCTACTAGT GACTTCACCT ATAGTATATC ACTTCTACAC 4920 

TAGTTTGTAA AGCATAATAG TTAATACTCT TCGAAAATCT CTTCAAACCA CGTCAGCGTC 4 980 

GCCCTACCGT ATGTATGGTT ACTGACTTCG TCAGTTTCAT CTACAACCTC AAAAACATGT 5040 

TTTGAGCTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAC AGTGTTTTGA GCTGACTTCG 5100 

TCAGTTTCAT CTACAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT TTCATCTACA 5160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTTCGT CTACAACCTC AAAACAGTGT 522 0 

TTTGAGCAAC CTGCGGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTGA GTATAAATAA 5280 

AAAAACAGAA CTAGCCTGAA CTAGTCCTGT CTACTTTTAC CCAATCACAC TTCCATTTGG 5340 

TACAGCTGGA TCAACTGTGA GAAGGGTTAA TTTGCCATCA TGTTCAGCTG AGAGAATCAT 54 00 

ACCCTGGCTG ACATATTTTT TCATCATTTT ACGTGGTTTG AGGTTAGCAA CGATTTGAAC 5460 

TTTCTTGCCG ACCAATTCTT GTTCATTTGG ATAGTATTTT GCAATTCCTG AAAGAATCTG 5 52 0 

ACGATCTTCT CCATCACCAG CAT C C AAGCG GAATTGAAGC AACTTATCTG AACCTTCTAC 5580 

T TT AG AC AC T TCTTTGACTT CTGCGACACG GATTTCAACC TTGTCAAAGT CTTCAAACTT 564 0 

GATTTCATCC TTGTTTAGTT TGAGCTCAAC TTCGTCCGGA TTCCATTCTT TTTCGACTGC 57 0 0 

TGGTTTATTG CCTTCCATTT GTTCCTTGAT ATAGGCGATT TCTTCTTCCA TATTTAGACG 57 60 

TGGAAAGATA GGTGTTCCTT TGGCAACTAC AGTCACATCT GCTGGGAAGT CAGCCAAACT 5820 

CAAGTTTTCA AG AC TAG AAA CTTCTTCCAA AC C AAGTTG A GTCAAAACTG CACGACTAGT 5880 

TT C CAT CAT A AATGGTTCAA TCAAGTGAGC AACTACACGA ATGCTGGCTG CCAAGTGGCT 5940 

CATGACACTT GCCAATTGGT C ACGAAG AG C TTCATCCTTG GCCAAGACCC ATGGTGCGGT 6000 

CTCATCGATG TATTTATTGG TACGAGAGAT CAGAGTCCAG ACTGCTTCAA GCGCACGTGG 6060 

ATAGTCAACT GCTTCCATGT GTGTATGGAA GTCTG CG ATT GAT TGTwCTG CAACCTCAGC 612 0 

AAGAAC AT G A TC AT AT T C AG TCACACCTTC TACATAGGCA GGGATTTGTC CATCAAAGTA 6180 
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CTTATTAATC ATGGAAACCG TACGGTTAAG GAGGTTCCCA AGGTCATTAG CCAATTCATA 6240 

GTTGATACGG CCGACATAGT CTTCAGGAGT AAAGGTTCCG TCTGAACCAA CTGGAAGGTT 6300 

ACGCATGAGG TAGTAACGAA GTGGATCTAG TCCATAACGC TCTACCAACA TTTCAGGGTA 63 60 

AACG AC AT T C CCTTTTGACT T AG AC AT TT T TCCGTCTTTC ATGACAAACC AACCATGGGC 6420 

AATCAAACGA TCAGGTAATT TAACATCCAA CATCATAAGA AGGATTGGCC AGTAGATAGA 6480 

GTGGAAGCGA AGGATATCTT TTCCTACCAT ATGGAAGACT GTTCCATTCC AGAACTTGTC 6540 

AAAGTTACCA TGTTCGTCTT GAGCGTAGCC AAGAGCTGTC GCATAGTTAA GAAGGGCATC 6600 

AATCCAAACG TAGACAACGT GTTTTGGATT TGATGGGACA GGCACTCCCC ATGTAAAGGT 6660 

TGTACGAGAT ACCGCCAAAT CTTCCAAGCC TGGCTCGATG AAGTTGCGTA GCATTTCATT 672 0 

AAGGCGACCA T C TGGCGTG A TAAATTCAGG ATGAGCTTTG AAAAATTCGA CCAAACGGTC 6780 

TTGGTATTTG CTAAGGCGAA GGAAGTATGA TTCTTCAGAA Ac CC ATT C AA CCTCATGACC 6840 

TGATGGAGCA ATACCACCAG TCACATTTCC AGCTTCATCA CGGAAAACTT CTGCCAGCTG 69 00 

GCTTTCTGTA AAGAATTCTT CGTCTGATAC TGAATACCAA CCAGAGTATT CACCCAAGTA 69 6 0 

GATATCATCT TGAGCAAGTA AG CGTT C AAA GACTTGTGCG ACAACTTTTT CATGGTAGTC 702 0 

ATCAGTTGTA C GG AT AAATT TATCGTATGA GATATCTAGT AATTGCCAGA GTTCTTTAAC 7080 

TCCAACCGCC ATTCCATCAA CATAGGCTTG AGGTGTAATA CCAGCTTCTT CCGCTTTCTG 7140 

CTGGATTTTC TGACCATGTT CATCAAGACC TGTCAGATAA AATACATCGT AGCCCATCAG 7200 

GCGTTTGTAA CGTGCTAGGA CATCACATGC GATAGTTGTG TAGGCAGAAC CGATATGAAG 7260 

T TT C C C AG AT GGATAGTAAA TCGGCGTTGT AATATAAAAA TTTTTTTCAG ACATAATTTT 732 0 

TCCTTTCCAG GCAAATGAAA CCTGTTTTTC TAACACTTCA TTATATCACA TTTTTAATGA 73 80 

ATTTCAATAG GGAAATCCAT ACAAAAACAA GATAGACGAG TGTCCATCTT GTTGATCTCA 7440 

TTCATAACGA AGGGCTTCAA TTGGATCAAG TTTCGATGCC TTGTTGGCTG GCAAGACTCC 7 500 

AAAAAT C AT A C C AAC ACT AG CCGAAACTGC AAGACTAAAT AGGGCGACTG GG AT TG AT AC 7 5 60 

TCCAACTTCT ATACCTTCTA TTAAACCTTG CAGTAACAAA CCTGCTAAGg CAGTTAAACC 7 62 0 

ACTTGCAATT GTCAAGCCAA TTAAGCCACC TAACAAGGTC AAAATC AT GG ATTCAATCAA 7680 

AAACTGAATT AAAATATTGG CACGTGTTGC ACCCAAAGCC TTACGAAGAC CAATCTCACG 7740 

AGTGCGCTCT GTCACCGAAA CCAGCATGAT GTTCATGACA CCAGTTCCTC CAACAAAGAG 7800 

AGAAATCCCT GCGATGGAAC TAATAATCGT CGTCATAAAA CTAAACGATT GTTGAATTTC 7 8 60 

TGCAAATACA ACGGACTCAT CTGCCACCTG GTATTCTCCC TGTTGTAAGC CTGCAAGCTC 7920 
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TGTCATTTTT 


CGTGCCAGTT 


CTGGACCCAG 


AGTTGGGGTT 


AAACTGGTAT 


CATTCACTCG 


7980 


AAAGACAATA 


TTAGCTATTT 


CATCTACATT 


AAAATTCGCA 


GCAAGGGAGA 


TATTGGTAGT 


8040 


AATAGGCAAG 


CCACCAAACC 


CATATATTTT 


TGATCTTTTA 


GCCTCCGGAC 


TAGTATAAAC 


8100 


CCCAATGACC 


CGGTAACTAA 


ATCCATTGAC 


TTCTACAACC 


TTGTTAATAG 


CCTCTTGAGG 


8160 


AGATTCAAAT 


AAACTAATGG 


ACAATTCCTC 


AT CT AGC AAA 


ATGACACTTG 


CAAACTCTTT 


8220 


GAAATCTTGC 


TCTCTCAGAC 


TACGACCTGC 


AATAATTTCA 


TTCTTAACAG 


CGTCCATGTA 


8280 


AGTTCTGTTT 


CCACCTGTCA 


AATTAGCATT 


CTCAACCTTT 


TTATCTTGAT 


AGGTCAAGAT 


8340 


GGCATTCGTT 


GAATTGGTTA 


CATAGTAACT 


ATCCACTCCC 


TTCAGTTTAG 


CTGCCTCTTG 


8400 


GACCCAGGAT 


TCTTGCGGTT 


TTGGCGGTTC 


AACAGGAACT 


TCCTCTTCCT 


TTCCAGAAAC 


8460 


CGTAAAAGCT 


GATTGTTTCT 


GAGTAAAAGA 


CCCGTCTTTA 


CTTTTTTTAG 


GAGAGAAAAA 


8520 


GACG CTAAT A 


TTTTTCTGAG 


ATTTAGTCAT 


ATCTTTATTG 


ACTTGACGAG 


ATAGGGAATC 


8580 


ACCCAAAGCC 


ATAATCACAA 


CAACTGATGA 


AACACCGATA 


ATAATCCCAA 


TCATAGTAAG 


8640 


CAAAGAACGC 


ATCTTGTGAG 


CCATGATAGA 


TGAAAAGGCA 


AATT T C AG AT 


TCTGCATCTT 


8700 


AGTTTTCCTC 


CTTTCCTAAC 


T G AG C AC T GT 


CAGACGAAAT 


GACCCCATCC 


CGAATGACAA 


8760 


TCTGACGTTT 


GGCATAGGCA 


GCAATCTCAG 


GCTCATGCGT 


TACCATGATA 


ATGGTTTTTC 


8820 


CTTCTTTATT 


CAAATCAACC 


AATAATTGCA 


TAATTTGGTT 


ACCTGTTTTG 


GTATCCAAGG 


8880 


CTCCTGTCGG 


TTCATCCGCT 


AGGATAATAG 


AAGGATTGTT 


TACCAAGGCA 


CGCGCAATGG 


8940 


CTACACGTTG 


CTTTTGACCA 


CCAGATAATT 


CTGAAGGTAA 


ATGGTGACTA 


CGTTCTGTCA 


9000 


ATTCAACCTT 


GTCTAAATAT 


TCCTCAGCCA 


ACTTGCGACG 


TTTTGAAGAC 


GAAACTCCTG 


9060 


CGTAAATCAA 


GGGCAATTCT 


ACATTTTGCA 


GAGCATTGAG 


CTTCGATAGA 


AGAAAGAACT 


9120 


GCTGAAAGAC 


AAAACCGATT 


TGTTGGTTAC 


GGACCTTAGC 


TAGTTGTTTT 


TCACCAAGCC 


9180 


CAGCCACTTC 


TTGACCTTCA 


AGATAATATT 


CTCCACTGGT 


TGGTGTATCC 


AACATGCCAA 


9240 


TCGTATTCAT 


CAGAGTGGAC 


TTACCAGACC 


CAGATGGTCC 


CATGATGGCT 


ACAAATTCAC 


9300 


CCTCATTCAC 


TTCTAGATTG 


ATATTTTTGA 


GAACCTGCAG 


TTCTTGGTCA 


CCATTACGGT 


9360 


AACTTCTGAA 


GATATTTTTT 


AGACTAATTA 


GTTGCTTCAT 


CAGCCTTCAC 


CTCTTTTCCT 


9420 


TCTTCCAAGG 


AAGATGTTGG 


AT T ACTGATG 


ACCTTAGCAC 


CGTTCGTTAA 


ACCAGAAGTG 


9480 


ATTTCTTGAT 


TTTCTGCGTC 


AGCATTTCCC 


AATGAAACCT 


CAACTTTTTT 


AGCCTTTTGT 


9540 


TGTTCATCCA 


CAATCCAGAC 


ATAATTTTTA 


CTATCATCCA 


TTACTAGACT 


GCTAACAGGA 


9600 


ACAAGAATAG 


CCTTAGTTTT 


GCTTTTAACC 


TCAATGTTGA 


C AG AAAAAC C 


TTGTTTCAAA 


9660 


TCACCAACCT 


CGCCTGTCAC 


ATCAATAGTA 


TAAGGGTATT 


TAGAACCTGT 


ATTATTCCCG 


9720 
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GCTGCTGGAC TAGCTGCTTC ACCATTGTTT TTAGGATAGT CAGAAATATA GCTTAATTTC 9 7 80 

CCAGTCCATT TTTTATCAGG ATACACTTTA GAAGTAAAGC TTACTTCTTG ACCTACAGAA 9840 

AGGTTGGCTA G ATTGT ACT C AGACAATTCT CCCTTGACTT GTAAATTTTC ATTGCTGACA 9900 

ATATGAACCA T AAC T TG ACT CGCCCCTGTT GG AG AT T T AG AAACATTGCT ATTGACTTCG 99 60 

ACCACAGTTC CCTCTAGGGT ACTGAGAACA GTTGTTGCAT C C AATTG AC T TTGAGCCTTG 10020 

CTTAATTGCG CCGCAGCATC TGCACGCGCA TCACGGGCAT CACCC AATTG AGCGTCAATA 10080 

GAAGCAACAG AATTTCCAGC CACTGGAGTT GGGCTTTGCA CCGTTGCATC TTCTCCTCCT 10140 

ACTGGCGCTG GTAACTGTGG AGCCGGAGCT GAAGCGGCTT CATTTCGTGC TTGATTGAGT 102 00 

T CAT TG AT AT GACGATCTGC CCTAGCTACT GCTCGACTAG CTGAATCATA GGCCGCCTGC 102 60 

GCTTCTGAAC TACTGTACTT GACTAAAGCC TGCCCTTCGC TGACCTTATC GCCCACAGAA 1032 0 

ACAAGGATTT CATCTAAATC ACCCTTACTA GCATCAAAAT AAACATATTG TTCATTTTTT 103 80 

GCTGTTACTG TCCCTGACAA TAAAACAGAG GAGGCCACGC TTCCTTCCTT GGCAACAACA 10440 

AGATGAGTAG GCTCATCTTT T AG AGCAGT C TGAGAAGGTT GTCTAAAGAG TAAAATCCCC 10500 

CCAGCACCCA ATACAACTAC ACTCGCAGCA CCGATTGCTG CATACAGTTG CCACTTTTTA 1056 0 

GCTTTACCAT TCTTTTTCTT CATAATGAAA CTCCTTTTCT TTTTTACAAT ACTTTGCTAT 1062 0 

TATACCAAAT TTCCCTCCAG CAAACAATAC AGTTCAGGAT TAAACAATCG TTCGGAATTT 10 680 

TGCTTTTCGG 10 690 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCCA 60 

GGGTCGGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGCCT GGTATCTTTA 120 

TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 180 

GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 240 

CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG AT AAC CGT AT 3 00 

TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 360 
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AGTGAGCGAG 


GAAGCGGAAG 


AGCGCCCAAT 


ACGCAAACCG 


CCTCTCCCCG 


CGCGTTGGCC 


420 


GATTCATTAA 


TGCAGCTGGC 


ACGACAGGTT 


TCCCGACTGG 


AAAGCGGGCA 


GTG AG CGCAA 


480 


CGCAATTAAT 


GTGAGTTAGC 


TCACTCATTA 


GGCACCCCAG 


GCTTTACACT 


TTATGCTTCC 


540 


GGCTCGTATG 


TTGTGTGGAA 


TTGTGAGCGG 


ATAACAATTT 


CACACAGGAA 


ACAGCTATGA 


600 


CaTGATTACG 


AATTCGAGCT 


CGGTACCCGG 


AAAATCCAGA 


AAATGCTTGA 


AAAAAATCCT 


660 


AGAAGATGGT 


ATAATACTAA 


ATTGTAAGGG 


TT AT C AC AT A 


TAACTCAAAA 


AAAGAAAGAA 


720 


CAAAAGGAGA 


GTCAAACTAT 


GGCTTCTAAA 


GATTTCCACG 


TAGTGGCAGA 


AACAGGTATT 


780 


CACGCACGTC 


CAGCAACATT 


GTTGGTACAA 


ACTGCTAGCA 


AATTTGCTTC 


AGATATCACT 


840 


CTTGAGTACA 


AAGGTAAATC 


AGTTAACCTT 


AAATCAATTA 


TGGGTGTTAT 


GAGTCTTGGT 


900 


GTTGGCCAAG 


GTGCTGACGT 


AACTATCTCA 


GCTGAAGGTG 


CAGATGCAGA 


TGACGCTATC 


960 


GCTGCAATCT 


CAGAAACAAT 


GGAAAAAGAA 


GGATTGGCAT 


AAGGGAAATG 


ACAGAAATGC 


1020 


TTAAAGGAAT 


CGCAGCATCT 


GACGGTGTTG 


CAGTTGCAAA 


AG CAT AT C T A 


CTCGTTCAGC 


1080 


CGGATTTGTC 


ATTTGAGACT 


ATTACAGTCG 


AAGATACAAA 


C GC AG AAGAA 


GCTCGCCTTG 


1140 


ATGCCGCTCT 


ACAGGCATCA 


CAAGACGAGC 


TTTCTGTTAT 


TCGCGAGAAA 


GCAGTAGGTA 


1200 


CGCTCGGTGA 


AGAAGCAGCT 


CAAGTTTTTG 


ATGCTCACTT 


AATGGTTCTT 


GCTGACCCAG 


1260 


AAATG AT C AG 


CCAAATCAAG 


GAAACTATCC 


GTGCG AAGAA 


AGTGAATGCA 


GAAGCAGGTC 


1320 


TGAAAGAAGT 


TACAGATATG 


TTTATCACTA 


TCTTTGAAGG 


CATGGAAGAC 


AACCCATACA 


1380 


TGCAAGAACG 


CGCAGCGGAT 


WTCCGCGACG 


TGACAAAACG 


TGTATTGGCA 


AACCTTCTTG 


1440 


GTAAAAAATT 


GCCAAACCCA 


GCTTCTATCA 


ATGAAGAAGT 


GATTGTGATT 


GCGCATGACT 


1500 


TGACTCCTTC 


AGATACAGCT 


CAATTGGACA 


AAAACTTTGT 


AAAAGCTTTT 


GT AAC C AAC A 


1560 


TTGGTGGACG 


T AC AAGC C AC 


TC AG C TAT C A 


TGGCACGTAC 


ACTTGAAATT 


GCTGCTGTAT 


1620 


TAGGTACAAA 


T AAC AT C ACT 


GAAATCGTTA 


AAGACGGTGA 


CATCCTTGCT 


GTTAACGGGA 


1680 


T C AC TGG AG A 


AGTGATTATC 


AACCCAACAG 


ATGAACAAGC 


GGCAGAATTT 


AAAGCAGCTG 


1740 


GTGAAGCCTA 


TGCGAAACAA 


AAAGCTGAAT 


GGGCACTTTT 


GAAAGATGCT 


CAAACAGTGA 


1800 


CTGCTGACGG 


TAAACACTTC 


GAGTTGGCTG 


CTAATATCGG 


TACTCCAAAA 


GACGTTGAAG 


1860 


GTGTTAACAA 


CAACGGTGCA 


GAAGCTGTTG 


G AC T TT AC C G 


TACAGAGTTC 


TTGTACATGG 


1920 


ATTCTCAAGA 


CTTCCCAACT 


GAAGATGAGC 


AGTATGAAGC 


AT AC AAGG CT 


GTTCTTGAAG 


1980 


GAATGAACGG 


TAAACCTGTT 


GTCGTTCGTA 


CAATGGATAT 


CGGTGGAGAT 


AAGG AACTT C 


2040 


CTTACTTCGA 


TATGCCTCAC 


GAAATGAACC 


CATTCCTTGG 


ATTCCGTGCT 


CTTCGTATCT 


2100 


CTATCTCTGA 


GACTGGAGAT 


GCTATGTTCC 


GCACACAAAT 


CCGTGCTCTT 


CTTCGTGCGT 


2160 
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CTGTTCACGG TCAATTGCGT ATCATGTTCC CAATGGTTGC GCTCTTGAAA GAATTCCGTG 222 0 

CAGCGAAAGC AGTCTTTGAT GAAGAAAAAG CAAACCTTCT TGCTGAAGGT GTTGCAGTTG 2280 

CGGATAACAT CCAAGTTGGT ATCATGATCG AGATTCCTGC AGCGGCTATG CTTGCAGACC 2 34 0 

AATTTGCTAA AGAAGTTGAC TTCTTCTCAA TTGGTACAAA CGACTTGATC CAATATACAA 2400 

TGGCAGCAGA CCGTATGAAC GAACAAGTTT CATACCTTTA CCAACCATAC AACCCATCAA 24 60 

TCCTACGCTT GATTAACAAT GTGATCAAAG CAGCTCACGC TGAAGGTAAA TGGGCTGGTA 2 52 0 

TGTGTGGTGA GATGGCTGGT GACCAACAAG CTGTTCCACT TCTTGTCGGA ATGGGCTTGG 2 580 

ATGAGTTCTC TATGTCAGCA ACATCTGTAC TTCGTACACG CAGCTTGATG AAGAAACTCG 2 640 

AC AC AG C T AA GATGGAAGAG TACGCAAACC GTGCCCTTAC AGAATGCTCA ACAATGGAAG 27 00 

AAGTTCTTGA ACTTCAAAAA GAATACGTTA ATTTTGATTA ATCGAAAAGT CCCTGCAACT 27 60 

CAGTTACAGG GATTTTTTTG AT ATT TT AAA AAGAATTTTC AAGAAAATCT TT CTT AT AG A 2 82 0 

AAGTCCAACC TTGAAAAAGT AGTGGTCAGA ACAAAAAATA CTTAAATGGT T C AT AAAATT 2 880 

CTTGACAAGT TGG AT AT T T A GGAGTAAACT ATTAACCAGT TAAGTAATAG AGAGGAGTTT 2 94 0 

CTGCAATTTA GAAATGAATT GCAACTAGAA ATATCAAATA GAAAGAGAGT TTCGATGAAA 3 000 

ATTAATAAGA AATACCTTGT TGGTTCTGCG GCACTTTGAT TTTAAGTGTT TGTTCTTACG 3060 

AGTTGGGACT GTATCAAGCT AGAACGGTTA AGGAAAATAA TCGTGTTTCC T AT AT AG AT G 312 0 

GAAAACAAGC GACGCAAAAA AC GG AG AATT TGACTCCTGA TGAGGTTAGC AAGCGTGAAG 3180 

GAATCAATGC TGAGCAAATC GTCATCAAGA T AAC AG AC C A AGGCTATGTC ACTTCACATG 3 24 0 

GCGACCACTA TCATTATTAC AATGGTAAGG TTCCTTATGA CGCTATCATC AGTGAAGAAT 33 00 

TACT CAT G AA AGATCCAAAC TATAAGCTAA AAGATGAGGA TATTGTTAAT GAGGTCAAGG 3 3 60 

GTGGATATGT TATCAAGGTA G AT GG AAAAT ACTATGTTTA CCTTAAGGAT GCTGCCCACG 342 0 

CGGATAACGT CCGTACAAAA GAGGAAATCA ATCGACAAAA ACAAGAGCAT AGTCAACATC 3 480 

GTGAAGGTGG AACTCCAAGA AACGATGGTG CTGTTGCCTT GGCACGTTCG CAAGGACGCT 3 54 0 

ATACTACAGA TGATGGTTAT ATCTTTAATG CTTCTGATAT CATAGAGGAT ACTGGTGATG 3 600 

CTTATATCGT TCCTCATGGA GATCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA 3 660 

GCGAGTTGGC TGCTGCAGAA GCCTTCCTAT CTGGTCGAGG AAATCTGTCA AATTCAAGAA 3 72 0 

CCTATCGCCG AC AAAAT AG C GAT AAC AC T T CAAGAACAAA CTGGGTACCT TCTGTAAGCA 3780 

AT CC AGG AAC TACAAATACT AACACAAGCA ACAACAGCAA CACTAACAGT CAAGCAAGTC 3 840 

AAAGTAATGA CATTGATAGT CTCTTGAAAC AGCTCTACAA ACTGCCTTTG AGTCAACGAC 3 900 
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ATGTAGAATC 


TGATGGCCTT 


GTCTTTGATC 


C AG C AC AAAT 


CACAAGTCGA 


ACAGCTAGAG 


3960 


GTGTTGCAGT 


GCCACACGGA 


GATCATTACC 


ACTTCATCCC 


TTACTCTCAA 


ATGTCTGAAT 


4020 


TGGAAGAACG 


AATCGCTCGT 


ATTATTCCCC 


TTCGTTATCG 


TTCAAACCAT 


TGGGTACCAG 


4080 


ATTCAAGGCC 


AGAACAACCA 


AGTCCACAAC 


CGACTCCGGA 


ACCTAGTCCA 


GGCCCGCAAC 


4140 


CTGCACCAAA 


TCTTAAAATA 


GACTCAAATT 


CTTCTTTGGT 


TAGTCAGCTG 


GTACGAAAAG 


4200 


TTGGGGAAGG 


ATATGTATTC 


GAAGAAAAGG 


GCATCTCTCG 


TTATGTCTTT 


GCGAAAGATT 


4260 


T AC C ATCTG A 


AACTGTTAAA 


AATCTTGAAA 


GCAAGTTATC 


AAAACAAGAG 


AGTGTTTCAC 


4320 


ACACTTTAAC 


TGCTAAAAAA 


GAAAATGTTG 


CTCCTCGTGA 


CCAAGAATTT 


TATGATAAAG 


4380 


CATATAATCT 


GTTAACTGAG 


GCTCATAAAG 


CCTTGTTTGA 


AAATAAGGGT 


CGTAATTCTG 


4440 


ATTTCCAAGC 


CTTAGACAAA 


TTATTAGAAC 


GCTTGAATGA 


TGAATCGACT 


AATAAAGAAA 


4500 


AATTGGTAGA 


TGATTTATTG 


GCATTCCTAG 


CACCAATTAC 


C C AT C C AG AG 


CGACTTGGCA 


4560 


AACCAAATTC 


TCAAATTGAG 


TATACTGAAG 


ACGAAGTTCG 


TATTGCTCAA 


TTAGCTGATA 


4620 


AGTATACAAC 


GTCAGATGGT 


TACATTTTTG 


ATGAACATGA 


TATAATCAGT 


GATGAAGGAG 


4680 


ATGCATATGT 


AACG C CT CAT 


ATGGGCCATA 


GTCACTGGAT 


TGGAAAAGAT 


AGCCTTTCTG 


4740 


ATAAGGAAAA 


AGTTGCAGCT 


CAAGCCTATA 


CTAAAGAAAA 


AGGTATCCTA 


CCTCCATCTC 


4800 


CAGACGCAGA 


TGTTAAAGCA 


AATCCAACTG 


GAGATAGTGC 


AGCAGCTATT 


TACAATCGTG 


4860 


TGAAAGGGGA 


AAAACGAATT 


CCACTCGTTC 


GACTTCCATA 


TATGGTTGAG 


CATACAGTTG 


4920 


AGGTTAAAAA 


CGGTAATTTG 


ATTATTCCTC 


AT AAGG AT C A 


TT AC CAT AAT 


ATTAAATTTG 


4980 


CTTGGTTTGA 


TGATCACACA 


TACAAAGCTC 


CAAATGGCTA 


TACCTTGGAA 


GATTTGTTTG 


5040 


CGACGATTAA 


GTACTACGTA 


GAACACCCTG 


ACGAACGTCC 


ACATTCTAAT 


GATGGATGGG 


5100 


GCAATGCCAG 


TGAGCATGTG 


TTAGGCAAGA 


AAGACCACAG 


TGAAGATCCA 


AATAAGAACT 


5160 


TCAAAGCGGA 


TGAAGAGCCA 


GTAGAGGAAA 


CACCTGCTGA 


GCCAGAAGTC 


CCTCAAGTAG 


5220 


AG AC TG AAAA 


AGTAGAAGCC 


CAACTCAAAG 


AAGCAGAAGT 


TTTGCTTGCG 


AAAGTAACGG 


5280 


ATTCTAGTCT 


GAAAGCCAAT 


GCAACAGAAA 


CTCTAGCTGG 


TTT AC G AAAT 


AATTTGACTC 


5340 


TTCAAATTAT 


GGATAACAAT 


AGTATCATGG 


CAGAAGCAGA 


AAAATTACTT 


GCGTTGTTAA 


5400 


AAGGAAGTAA 


TCCTTCATCT 


GTAAGTAAGG 


AAAAAATAAA 


CTAATGAAAA 


ATGAAAGTCT 


5460 


CGATAAAGAG 


GCTTTCATTT 


TTATTATGTA 


TATATGTAAA 


ATTCTTGACA 


AGCAATATTA 


5520 


AAAAGAGTAA 


ACTATTAACT 


AGTTAATTAA 


CCGGTTTATT 


ACTTTATAGT 


GAATCAAATA 


5580 


TACTTAAGAA 


AAGAGGAAAG 


AAT GAAAATT 


AATAAAAAAT 


ATCTAGCAGG 


TT C AGTGGC A 


5640 


GTCCTTGCCC 


TAAGTGTTTG 


TTCCTATGAA 


CTTGGTCGTC 


ACCAAGCTGG 


TCAGGTTAAG 


5700 
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AAAGAGTCTA ATCGAGTTkC TTATATAGAT GGTGATCAGG CTGGTCAAAA GGCAGAAAAC 57 60 

TTGACACCAG ATGAAGTCAG TAAGAGGGAG GGGATCAACG CCGAACAAAT CGTCATCAAG 582 0 

ATTACGGATC AAGGTTATGT GACCTCTCAT GG AG AC C AT T ATCATTACTA TAATGGCAAG 5880 

GTCCCTTATG ATGCCATCAT CAGTGAAGAG CTCCTCATGA AAGATCCGAA TTATCAGTTG 5 94 0 

AAGG ATT C AG ACATTGTCAA TGAAATCAAG GGTGGTTATG TTATCAAGGT AGAT GGAAAA 600 0 

T ACT ATGT T T AC CTT AAGG A TGCAGCTCAT GCGGATAATA TTCGGACAAA AGAAGAGATT 60 60 

AAACGTCAGA AGCAGGAACA CAGTCATAAT CACGGGGGTG GTTCTAACGA TCAAGCAGTA 612 0 

GTTGCAGCCA GAGCCCAAGG ACGCTATACA ACGG AT GAT G GTTATATCTT CAATGCATCT 6180 

GATATCATTG AGGACACGGG TGATGCTTAT ATCGTTCCTC ACGGCGACCA T T AC C ATT AC 624 0 

ATTCCTAAGA ATGAGTTATC AGCT AG CG AG TTAGCTGCTG CAGAAGCCTA TTGGAATGGG 6300 

AAGCAGGGAT CTCGTCCTTC TTCAAGTTCT AGTTATAATG CAAATCCAGC TCAACCAAGA 63 6 0 

TTGT C AG AG A ACCACAATCT GACTGTCACT CCAACTTATC ATCAAAATCA AGGGGAAAAC 642 0 

ATTTCAAGCC TTTTACGTGA ATTGTATGCT AAACCCTTAT CAGAACGCCA TGTGGAATCT 64 80 

GATGGCCTTA TTTTCGACCC AG CG CAAAT C ACAAGTCGAA CCGCCAGAGG TGTAGCTGTC 6 54 0 

CCTCATGGTA ACCATTACCA CTTTATCCCT TATGAACAAA TGTCTGAATT GG AAAAAC G A 6 60 0 

ATTGCTCGTA TTATTCCCCT TCGTTATCGT TCAAACCATT GGGTACCAGA TTCAAGACCA 6 660 

GAACAACCAA GTCCACAATC GACTCCGGAA CCTAGTCCAA GTCCGCAACC TG C AC CAAAT 672 0 

CCTCAACCAG CTCCAAGCAA TCCAATTGAT GAGAAATTGG TCAAAGAAGC TGTTCGAAAA 67 80 

GTAGGCGATG GTTATGTCTT TGAGGAGAAT GGAGTTTCTC GTTATATCCC AG CC AAGG AT 684 0 

CTTTCAGCAG AAACAGCAGC AGGCATTGAT AGCAAACTGG CCAAGCAGGA AAGTTTATCT 6 900 

CATAAGCTAG GAGCTAAGAA AACTGACCTC CCATCTAGTG ATCGAGAATT TTACAATAAG 69 60 

GCTTATGACT TACTAGCAAG AATTCACCAA GATTTACTTG ATAATAAAGG TCGACAAGTT 7 02 0 

GATTTTGAGG CTTTGGATAA CCTGTTGGAA CGACTCAAGG ATGTCyCAAG TGATAAAGTC 708 0 

AAGTTAGTGG ATGATATTCT TGCCTTCTTA GCTCCGATTC GTCATCCAGA ACGTTTAGGA 714 0 

AAACCAAATG CGCAAATTAC CTACACTGAT GATGAGATTC AAGTAGCCAA GTTGGCAGGC 7200 

AAGTACACAA CAGAAGACGG TTATATCTTT GATCCTCGTG ATATAACCAG TGATGAGGGG 72 60 

GATGCCTATG TAACTCCACA TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT 73 2 0 

GAAGCTGAGA GAGCGGCAGC CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG 7 380 

AC AGAC CAT C AGGATTCAGG AAATACTGAG GCAAAAGGAG CAGAAGCTAT CTACAACCGC 7440 
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GTGAAAGCAG CTAAGAAGGT G C C ACT T GAT CGTATGCCTT ACAATCTTCA ATATACTGTA 7 500 

GAAGTCAAAA ACGGTAGTTT AATC AT AC C T CATTATGACC ATTACCATAA CATCAAATTT 7 5 60 

GAGTGGTTTG ACGAAGGCCT TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG 7 62 0 

GCGACTGTCA AGTACTATGT CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT 7680 

GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGT AG AC C AAGACAGTAA ACCTGATGAA 7 74 0 

GATAAGGAAC ATGATGAAGT AAGTGAGCCA ACTCACCCTG AATCTGATGA AAAAGAGAAT 7 800 

CACGCTGGTT TAAATCCTTC AGCAGATAAT CTTTATAAAC CAAGCACTGA TACGGAAGAG 7860 

ACAGAGGAAG AAGCTGAAGA TACCACAGAT G AGG C TG AAA TTCCTCAAGT AGAGAATTCT 7920 

GTTATTAACG CTAAGATAGC AG AT GCGG AG GCCTTGCTAG AAAAAGTAAC AGATCCTAGT 7 980 

ATTAGACAAA ATGCTATGGA G AC AT TG ACT GGTCTAAAAA GTAGTCTTCT TCTCGGAACG 804 0 

AAAGATAATA AC ACT AT T T C AG C AG AAGT A GATAGTCTCT TGGCTTTGTT AAAAGAAAGT 8100 

CAACCGGCTC CTATACAGTA GTAAAATGAA TGGAGCATAT TTTATGGAGA AGTAACCTTT 8160 

CGTGTTACTT CTCTTTTTTA GAAAAACGTA ACAGA 819 5 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

TTTACTAAAA GGAAAAAAGA ACTGATTTCT CAGTCCTTCA TTAATCTTAT TCCACACTAA 60 

ATAGGTATGG GTAAACAGGT TGTTGACCTT GGTGAATCTC GACTTCAACG TCTTCGAATT 12 0 

CTTCTACGAT TTCTTGAGCG ATTTCATTGG CAAGTTCTTC GCTTCCGTCT T C AC CT AC AT 180 

AGAAGGTTAC GATTTCACTG TCTTCATCCA ACATATGTTT CAAGGTTTCA GTCAATGT IT 24 0 

GGTGCATATC AGGGTTTGAC ACAAGAATTT TTCCATCCAC CATACCTAAA TTATCGTTTT 3 00 

CATGGATTTC TAAGCCATCG ATCGTTGTAT CACGCACGGC TGTTGTGACG CTTCCGCTAA 3 60 

CGACATCGCT AAGAGCAGCT GTCATACGCT CTTGGTTTTC TTCAATGGAC TTGCTTGGAT 420 

CAAAGGCAAG AAGACTTGTC ATACCTTGAG GAAGAGTGCG AGCCTCTACC ACTACCGCTG 4 80 

GTTGCTCCAA AACTTCTGCC GCAGATTGAG CTGCCATGAA GATGTTCTTG TTGTTTGGCA 54 0 

AGAAGATGAT GTTACGGGCA TTAACCTGTT CAACAGCCTT GATAAAGTCT TCTGTTGAAG 600 

GGTTCATGGT TTGACCGCCT TCGATAACAT AATCCACGCC TTGAGAACAG AAGATATCTG 6 60 
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CTAGACCTTT ACCAGCCACC ACAGCAATCA AAGCATACTC TTTTTCTTCA GCCGACTTGA 72 0 

TAACTTGAGT AGCTTCTTTC TCAACCTGTG CTTCGTGTTG GTTACGCATA TTGTCAACTT 780 
TTACCTTGAC CAAGCTACCA T ATTT G AG AC CTTCTTGCAT AACAAGTCCT GGATCTTCTG 84 0 

TATGAACATG GACTTTGACA ATTTCATCAT CGTTAACAAC AAGGAGAGAA TCTCCAAGCT 900 
CATCCAAGTA GTTACGGAAT TCATCGTAGT CAAAATCTTT AGCATAGGTT GGACCTTGCT 960 

TAAGAGCTAC CAT GAT T T C A GTACAGTAAC CAAACGTGAT GTCCTCAGTC GCTACGTGAC 102 0 

CAGCTACAGA CTTATGATGC TCTACATTGA TCATCTCACT CATGTTGGCA GGAGTCGCTA 1080 

CAAAGTCCTC AGATGCAATA TATTCGCCAG TAAGGGCTGA AAGGAAACCT TCGTAGATGA 1140 

AGACCAATCC TTGACCACCT GAGTCCACAA CGCCAACTTC TTTCAATACT GGAAGCATGT 12 00 

CTGGTGTTTT AGCTAGAGCT GTTTTAGCAC CTTCCAAGGC TGCGCGCATG ACTTCAACAG 126 0 

CGTCATCTGT TTGCTCAGCT TTTTTCTTAG C AC CG AT AG C AGCTCCACGA GAAACTGTTA 13 2 0 

AAATCGTTCC TTCAACAGGT TTCATCACTG CCTTATAGGC AACTTCCACA CCTGATTGGA 13 80 

AGGCCAGAGC CAAGTCTTGA CCTGTTAACT CGTCTTTATC CTTGATAGCT TGGGAAAATC 144 0 

CACGGAAAAG CTGAGACGTA ATCACTCCTG AGTTCCCACG CGCACCCATC AAAAGCCCTT 1500 

TGGCAAGAAT GCTCGCTACT TCTCCAACTG TAGAAGCTGG CTTGTCTGCA ACTTCTTTAG 15 6 0 

CACCATTTTC AATGGTCATT CCCATATTTG TCCCAGTATC TCCATCTGGA ACTGGAAAGA 162 0 

CGTTTAATGA ATTGACATAT TCAGCTTGCT TATTCAAGCG AGTTGATGCA GCCTGCACCA 1680 

TTTCTTGAAA TAAGCTAGTA GTAATTTTTG ACACGGTTAT TCTCCTACAA CTTTGATATT 174 0 

TTGAATGTAG AC AT T T AC AG TCTGAGCAGT AATTCCAAGC TGGTTTTCCA AGCTAAAGGC 180 0 

AACACGCTCT TGAATGTTTT TTGACACTTC ACTAATCTTT GTTCCGTAGC TTAACACGGT 1860 

ATATACATCA AC TGC AAT AC TGCCATCTTC GGCTGCCTTT AC G AC G AC AC CTTTAGAATA 192 0 

ATTTTCCTTA CCTAGCAGGG CTTGGAAATT ATCTTTGAGG GCATTTTTAC TAGCCATACC 1980 

GACCACACCA GAAATCTCAG TTGC 2 004 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CCGGGTTGGG CTGTTCGCCC ATTAAAGCGG CACCACAGCT GGGTTCAGAA CGTCGTGAGA 60 

CAGTTCGGTC CCTATCCGTC GCGGGCGTAG GAAATTTGAG AGGATCTGCT CCTAGTACGA 12 0 

GAGGACCAGA GTGGACTTAC CGCTGGTGTA CCAGTTGTCT TGCCAAAGGC ATCGCTGGGT 18 0 

AGCTATGTAG GGAAGGGATA AACGCTGAAA GCATCTAAGT GTGAAACCCA CCTCAAGATG 240 

AGATTTCCCA TGAT T AT AT A TCAGTAAGAG CCCTGAGAGA TGATCAGGTA GATAGGTTAG 3 00 

AAGTGGAAGT GTGGCGACAC ATGTAGCGGA CTAATACTAA TAGCTCGAGG ACTTATCCAA 3 60 

AGTAACTGAG AATATGAAAG CGAACGGTTT TCTTAAATTG AATAGATATT CAATTTTGAG 420 

TAGGTATTAC TCAGAGTTAA GTGACGATAG CCTAGGAGAT ACACCTGTAC CCATGCCGAA 4 80 

CACAGAAGTT AAGCCCTAGA ACGCCGGAAG TAGTTGGGGG TTGCCCCCTG TGAGATAGGG 540 

AAGTCGCTTA GCTCTAGGGA GTTTAGCTCA GCTGGGAGAG CATCTGCCTT ACAAGCAGAG 600 

GGTCAGCGGT TCGATCCCGT TAACTCCCAT TTTAGCGGGT GTAGTTTAGT GGTAAAACTA 66 0 

CAGCCTTCCA AGCTGTTGTC GCGAGTTCGA TTCTCGTCAC CCGCTTTGAA CTTTGTTCTT 72 0 

TGTACCAAGT TTTTGACTTG GGCGCGTAGC TCAGGTGGTT AGAGCGCACG CCTGATAAGC 7 80 

GTGAGGTCGG TGGTTCGAGT CCACTCGTGC CCATAGTGTT TAGTCCATTA CTAGGGGATT 840 

GGAATATTAT CTGTTCACTA AGAGGACACG GGCTTGTTCC CGTATAAACT ATTTTGGAGG 900 

ATTACCCAAG TCCGGCTGAA GGGAACGGTC TTGAAAACCG TCAGGCGTGT AAAAGCGTGC 960 

GTGGGTTCGA ATCCCACATC CTCCTTTTAT ATTAACGCGG GATGGAGCAG CTCGGTAGCT 102 0 

CGTCGGGCTC ATAACCCGAA GGTCGTAGGT TCAAATCCTG CTCCCGCAAT AAGGCTCGGT 1080 

AG CTC AGTTG GT AG AG CAAT GGATTGAAGC TCCATGTGTC GGCGGTTCGA TTCCGTCTCG 1140 

CGCCATTTAT AT AT TTTGG A AGGGT AG CG A AG AGGC T AAA CGCGGCGGAC TGTAAATCCG 12 00 

CTCCTTCGGG TTCGGGGGTT CGAATCCCTC CCCTTCCATT TTACGGGCAT AGTTTAAAGG 12 60 

TAGAACTAAG GTCTCCAAAA CCTTCAGTGT GGGTTCAATT CCTACTGCCC GTGTTAATAG 132 0 

AATTATGGCG GGTGTGGTGA AGTGGTTAAC ACACCAGATT GTGGCTCTGG CATGCGTGGG 13 80 

TTCGATCCCC ATCACTCGCC TATTTTATAT TGGGGTATAG CCAAGCGGTA AGGCAAGGGA 144 0 

CTTTGACTCC CTCATGCGTT GGTTCGAATC CAGCTACCCC AGTTACTATT TGCCGGCGTG 1500 

GCGGAATTGG CAGACGCGCT GG ACT C AAAA TCCAGTGTCC GCAAGGACGT GCCGGTTCGA 15 60 

CCCCGGCCGC CGGTATAGTA TAGTGTTAGG AACGTTGTTA TTCTTCGTTC CTTTTTTATA 162 0 

TTATTTTTGG TATAATTATA GTTATTCAAA TTTTATTTAG ATTAAGAAAG TGTAGGGGAG 168 0 

TATGTCTTGT TCTATCGATT TATTAAAACA TCGGTATTTG AAAAATATTA AAGAAAATCC 1740 

TGAATTGTTT GTCGGAATTG AGTTGGAGTA TCCTGTTGCA AGTTTAGAAG GGGATGCTAC 1800 
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AG A 1 G J/TGAA 


GT T ATG AAGG 


ATCTATTTCA 


TTATTTAGTT 


TCTACTTTGG 


ATCTCACCGT 


1860 


AGGAAAGG 1 A 


GATGATTTTG 


GCAATCTGAT 


CCAGTTAGTA 


GATCCGATAA 


GTCAGGATGC 


1920 


TATTTTATTT 


GAAGTTTCCT 


ATACAACGAT 


TGAGTTTGCA 


TTTGGTAAGG 


CTGAAACGAT 


1980 


TCAAGAGGTC 


GAAAATCGTT 


TCAATAATTA 


TATGAATGTA 


AT T C AG AG AA 


AGTTAGCTGA 


2040 


ATCAAATCAT 


GCTATTGTTG 


GCTGTGGTAT 


CCATCCCAAC 


TGGGATAAAA 


ATGAGAATTG 


2100 


TCCAGTGGCT 


TATCCACGCT 


ATCAGATGTT 


G ATGG AT TAT 


TTGAATTTGA 


GTAGAAATAT 


2160 


TAT TAAAT C A 


GATTTACATC 


ATTTCCCTGA 


ATATGGTACT 


TTTATCTGTG 


GGAGCCAGGT 


2220 


T C AGCTGG AT 


ATTTCAAAAA 


CCAACTACTT 


ACGGGTGATT 


AATGCTTTTA 


CT C AAATTG A 


2280 


AGCGGCTAAG 


GCTTATTTAT 


TTGCAAACTC 


TGAATTTTCG 


GGTGCGGATT 


GGGATACGAA 


2340 


AATTTCAAGG 


GATATTTTCT 


GGGAAGAATC 


T ATG C ATGG T 


ATCTATCCAG 


AGAATGTTGG 


2400 


GGTCAATGCT 


AGACTCCTTA 


ATGATGAAAC 


TGATTTTTTT 


GACTATCTAA 


ATCATTCTGC 


2460 


GATTTTTACT 


GCGGAACGTG 


ATGGGCAGAC 


CTATTATTTT 


TATCCTATTC 


AGGCTGGGGA 


2520 


CTATTTGGCT 


ACGTCCGAAA 


TCCAAGCATT 


TGCTCTGAAT 


GGGGATGAGG 


TT ATT AT T T A 


2580 


CCCCCAAGAG 


AAGG AT T T TG 


AAACTCATCG 


TAGTTACCAG 


TACCAAGATT 


TAACGACTCG 


2640 


AGGAACAGTT 


GAGTTTCGTA 


GTGTGTGTAC 


ACAGCCACTT 


GATAGGACTT 


TTGCTTCTGC 


2700 


AGCTTTTCAC 


TTGGGATTAT 


TGGTTAATTT 


AGACAAGTTA 


GAAGCTTACT 


T AG AAAC AG C 


2760 


ACCTTTCTTT 


AAAGTATTTG 


GTTATGATTA 


CAAGTCTTTA 


AGGAGACAAT 


TTTCTAAGAA 


2820 


AAATCT T AC A 


GATGAGGAAG 


AAACTACGAT 


TATTGAATTT 


TCCAAAGACT 


TACTCCTACT 


2880 


AGCTGAGGAG 


GG AC TAGTGG 


TGAGAAATAA 


GGAAGAAATG 


ACCTATTTAC 


AGCCTTTGAG 


2940 


AG AAGAAT TG 


AGCCTATAAT 


TTCTCTTATA 


AAGGGAGAAT 


TTTCTGAAAA 


AT C ATGAT AT 


3000 


AAI GGACGAG 


AC TAT AG AT A 


AAGGATAGAG 


AGTAATGACA 


TTAGTTTATC 


AATCAACGCG 


3060 


TGATGCCAAC 


AATACAGTAA 


CTGCCAGCCA 


AGCAATTTTG 


CAAGGTTTGG 


CGACGGACGG 


3120 


CGGTTTGTTT 


ACACCGGATA 


CTTATCCAAA 


GGTAGATTTG 


AACTTTGACA 


AATTGAAAGA 


3180 


TGCTTCTTAC 


CAGGAAGTTG 


CTAAGCTAGT 


TTTGTCAGCA 


TTTTTAGATG 


ACTTTACAGT 


3240 


TGAGGAGTTG 


GACTACTGTA 


TCAACAATGC 


CT AC GAT AG C 


AAATT TG AT A 


CTCCAGCTAT 


3300 


TGCACCATTA 


GTGAAATTAG 


ATGGGCAATA 


CAATTTGGAA 


CTTTTCCATG 


GTTCAACGAT 


3360 


TGCCTTTAAG 


GATATGGCCT 


TGTCTATTTT 


GCCATACTTT 


ATGACGACTG 


CTGCTAAGAA 


3420 


ACATGGTTTG 


GAGAACAAGA 


TTGTTATCTT 


GACAGCGACA 


TCTGGTGACA 


CGGGGAAAGC 


3480 


TGCTATGGCG 


GGGTTTGCGA 


ATGTGCCTGG 


TACTGAGATT 


ATCGTCTTTT 


AT C C AAAGG A 


3540 
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TGGTGTCAGC 


AAGATTCAAG 


AGTTACAAAT 


G AC C ACT C AG 


ACTGGCGACA 


ATACTCATGT 


3600 


TATTGCTATT 


GATGGTAACT 


TTG AC G ATG C 


GCAAACAAAT 


GTGAAGCACA 


TGTTTAACGA 


3660 


CGTGGCTCTT 


CGTGAAAAAT 


TGACTACCAA 


CAAGTTGCAA 


TTTTCATCAG 


CTAACTCTAT 


3720 


GAACATTGGT 


CGTCTGGTGC 


CACAAATTGT 


TTATTATGTT 


TATGCTTACG 


CTCAATTGGT 


3780 


TAAGACTGGT 


GAAATTGTAG 


CTGGTGAAAA 


GGTTAACTTC 


ACAGTACCAA 


CAGGAAACTT 


3840 


TGGAAATATC 


TTGGCTGCCT 


TTTATGCCAA 


ACAAATTGGT 


TTGCCAGTTG 


GTAAATTAAT 


3900 


CTGTGCTTCA 


AATGACAACA 


ATGTTTTGAC 


AGACTTCTTT 


AAAACACGTG 


TCTATGACAA 


3960 


AAAACGTGAG 


TTTAAGGTAA 


CAACCAGCCC 


ATCTATGGAT 


ATCTTGGTAT 


CTTCAAACTT 


4020 


GGAGCGCTTG 


ATTTTCCATC 


TTTTGGGAAA 


TAATGCTGAA 


AAGACAACTG 


AACTTATGAA 


4080 


TGCCTTGAAC 


ACGCAAGGAC 


AATATAAGTT 


GACAGACTTT 


GATGCAGAGA 


TTTTGGACCT 


4140 


CTTTGCAGCT 


GAATATGCGA 


CTGAGGAAGA 


AACGGCAGCA 


GAGATCAAGC 


GTGTTTGTGA 


4200 


GTTAGATTCT 


TATATCGAGG 


ACCCTCATAC 


AGCTGTTGCT 


TCAGCAGTTT 


ATAAAAAATA 


4260 


CCAATCGGCC 


ACTGGAGATG 


TAACTAAGAC 


AGTGATTGCT 


TCAACAGCTA 


GTCCATACAA 


4320 


GTTCCCAGTA 


GTTGCAGTAG 


AAGCTGTAAC 


TGGAAAAGCA 


GGTTTAACAG 


ACTTTGAAGC 


4380 


CTTGGCTCAA 


T T AC ATG AAA 


TCTCAGGCGT 


TGCAGTGCCA 


CCAGCAGTTG 


ATGGGCTTGA 


4440 


AATAGCTCCA 


ATTCGTCACA 


AGACAACAGT 


GGCAGCTGCT 


GACATGCAAG 


CAGCGGTTGA 


4500 


GGCTTATTTA 


GGACTTTAAG 


ACAGAGGGAG 


CAAACTCGGT 


TGGGAAACCA 


ACTGAGTTTC 


4560 


TTTTCATCAG 


GAGGAGAGAT 


TGTTTAAGAA 


AAATAAAGAC 


ATTCTTAATA 


TTGCATTGCC 


4620 


AGCTATGGGT 


GAAAACTTTT 


TGCAGATGCT 


AATGGGAATG 


GTGGACAGTT 


ATTTGGTTGC 


4680 


TCATTTAGGA 


TTGATAGCTA 


TTTCAGGGGT 


TTCAGTAGCT 


GGTAATATTA 


TCACCATTTA 


4740 


TCAGGCGATT 


TTCATCGCTC 


TGGGAGCTGC 


TATTTCCAGT 


GT T ATT T C AA 


AAAGCATAGG 


4800 


GCAGAAAGAC 


CAGTCGAAGT 


TGGCCTATCA 


TGTGACTGAG 


GCGTTGAAGA 


TTACCTTACT 


4860 


ATTAAGTTTC 


CTTTTAGGAT 


TTTTGTCCAT 


CTTCGCTGGG 


AAAGAGATGA 


TAGGACTTTT 


4920 


GGGGACGGAG 


AGGGATGTAG 


CTGAGAGTGG 


TGGACTGTAT 


CTATCTTTGG 


TAGGCGGATC 


4980 


GATTGTTCTC 


TTAGGTTTAA 


TGACTAGTCT 


AGGAGCCTTG 


ATTCGTGCAA 


CGCATAATCC 


5040 


ACGTCTGCCT 


CTCTATGTTA 


GTTTTTTATC 


CAATGCCTTG 


AATATTCTTT 


TTTCAAGTCT 


5100 


AGCTATTTTT 


GTTCTGGATA 


TGGGGATAGC 


TGGTGTTGCT 


TGGGGGACAA 


TTGTGTCTCG 


5160 


TTTGGTTGGT 


CTTGTGATTT 


TGTGGTCACA 


ATTAAAACTG 


CCTTATGGGA 


AGCCAACTTT 


5220 


TGGTT T AG AT 


AAGGAACTGT 


TGACCTTGGC 


TTTACCAGCA 


GCTGGAGAGC 


GACTTATGAT 


5280 


GAGGGCTGGA 


GATGTAGTGA 


TCATTGCCTT 


GGTCGTTTCT 


TTTGGGACGG 


AGGCAGTTGC 


5340 
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TGGGAATGCA ATCGGAGAAG TCTTGACCCA GTTTAACTAT ATGCCTGCCT TTGGCGTCGC 5400 

TACGGCAACG GTCATGCTGT TGGCCCGAGC AGTTGGAGAG GATGATTGGA AAAGAGTTGC 54 60 

TAGTTTGAGT AAACAAACCT TTTGGCTTTC TCTGTTCCTC ATGTTGCCCC TGTCCTTTAG 552 0 

TATATATGTC TTGGGTGTAC CATTAACTCA TCTCTATACG ACTGATTCTC TAGCGGTGGA 55 8 0 

GGCTAGTGTT CTAGTGACAC TGTTTTCACT ACTTGGGACC CCTATGACGA CAGGAACAGT 5 640 

CATCTATACG GCAGTCTGGC AGGGATTAGG AAATGCACGC CTCCCTTTTT ATGCGACAAG 57 00 

TATAGGAATG TGGTGTATCC GCATTGGGAC AGGATATCTG ATGGGGATTG TGCTTGGTTG 57 6 0 

GGGCTTGCCT GGTATTTGGG CAGGGTCTCT CTTGGATAAT GGTTTTCGCT GGTTATTTCT 5 82 0 

ACGCTATCGT TACCAGCGCT ATATGAGCTT GAAAGGATAG GAAATGCAAA AAACAGCTTT 5880 

TATTTGGGAT TTAGACGGGA CTTTATTGGA CTCTTACGAA GCGATTTTAT CAGGGATTGA 5940 

GGAGACTTTT GCTCAGTTTT CTATTCCTTA TGATAAGGAG AAGGTGAGAG AGTTTATCTT 6000 

CAAGTATTCG GTGCAAGATT TGCTTGTGCG GGTGGCAGAA G AT AGAAAT C TGGATGTTGA 606 0 

GGTGCTAAAT CAGGTGCGTG CCCAGAGTCT GGCTGAGAAG AATGCTCAGG TAGTTTTGAT 612 0 

GCCAGGTGCG CGTGAGGTGC TAGCTTGGGC AGACGAATCA GGAATTCAGC AGTTTATATA 6180 

TACT CAT AAG GGGAACAACG CTTTTACCAT TCTCAAGGAC TTGGGGGTGG AATCCTATTT 624 0 

TACAGAGATT TTAACCAGTC AGAGTGGCTT TGTGCGGAAG CCAAGTCCAG AAGCGGCTAC 63 0 0 

CTATCTGCTA GATAAGTATC AGTTGAATTC TGATAATACT TATTATATAG GGGATCGGAC 63 60 

TCTGGATGTG GAATTTGCCC AGAATAGTGG GATTCAAAGC ATCAACTTTT TAGAGTCTAC 6420 

TTATGAAGGG AATCACAGGA TTCAAGCGTT AGC AG AT AT T TCCCGTATTT TTGAGACTAA 6480 

GTGATAAAAA GATTGTGTCA GTTTTGTGAC AGAGACCTAA CAAACTATTT CAAGTAACCT 6540 

AGTTTGTTAC AAGGAATAGA CAGTTCTGTT AAATAGGCCC GAGAGGGCTT TTTTTCTACA 6600 

TTTTTTGTGT TATGATAGAC AGGTACTCAT TTGAAAGGAA TTTGAAAGAA TGAAGAAAAG 6 6 60 

AATGTTATTA GCGTCAACAG TAGCCTTGTC ATTTGCCCCA GTATTGGCAA CTCAAGCAGA 672 0 

AGAAGTTCTT TGGACTGCAC GTAGTGTTGA GCAAATCCAA AAC G ATTTG A CTAAAACGGA 6780 

CAACAAAACA AGTTATACCG TACAGTATGG TGATACTTTG AGCACCATTG C AG AAG C CT T 6840 

GGGTGTAGAT GTCACAGTGC TTGCGAATCT GAACAAAATC ACTAATATGG ACTTGATTTT 6900 

CCCAGAAACT GTTTTGACAA CGACTGTCAA TGAAGCAGAA GAAGTAACAG AAGTTGAAAT 6960 

CCAAACACCT CAAGCAGACT CTAGTGAAGA AGTGACAACT GCGACAGCAG ATTTGACCAC 7 02 0 

TAATCAAGTG ACCGTTGATG ATCAAACTGT TCAGGTTGCA GACCTTTCTC AACCAATTGC 7080 
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AGAAGTTACA AAGACAGTGA TTGCTTCTGA AGAAGTGGCA CCATCTACGG GCACTTCTGT 7140 

CCCAGAGGAG CAAACGACCG AAACAACTCG CCCAGTTGAA GAAGCAACTC CTCAGGAAAC 72 00 

GACTCCAGCT GAGAAGCAGG AAACACAAGC AAGCCCTCAA GCTGCATCAG CAGTGGAAGT 72 60 

AACTACAACA AGTTCAGAAG CAAAAGAAGT AGCATCATCA AATGGAGCTA CAGCAGCAGT 732 0 

TTCTACTTAT CAACCAGAAG AGACGAAAAT AATTTCAACA ACTTACGAGG CTCCAGCTGC 7 3 80 

GCCCGATTAT GCTGGACTTG CAGTAGCAAA AT CTG AAAAT GCAGGTCTTC AACCACAAAC 7440 

AGCTGCCTTT AAAGAAGAAA TTGCTAACTT GTTTGGCATT ACATCCTTTA GTGGTTATCG 7 500 

TCCAGGAGAC AGTGGAGATC ACGGAAAAGG TTTGGCTATC GACTTTATGG TACCAGAACG 7560 

TTCAGAATTA GGGGATAAGA TTGCGGAATA TGCTATTCAA AATATGGCCA GCCGTGGCAT 7 62 0 

TAGTTACATC ATCTGGAAAC AACGTTTCTA TGCTCCATTC GATAGCAAAT ATGGGCCAGC 7 680 

TAACACTTGG AACCCAATGC CAGACCGTGG TAGTGTGACA G AAAAT C ACT ATGATCACGT 7740 

TCACGTTTCA ATGAATGGAT AAACCCGACT TGATAACATC ATTTTGACGA ATGAGATCTA 7 8 00 

GCTTTCGTGA TGGAAAGCGA TTCTCGTTCG TTTTTTCTTT GTCATACTCT TCGAAAATCT 7 8 60 

CTTCAAACCA CGTCAGTTTT ATCTGAAACT TCAAAGCTGT GCTTTGAGCA ACCTGCGACT 7 92 0 

AGCTTCCTAG TTTGCTTTTT GATTTTCATT GAGTATCAAT TTGAATGGAA AATGGAAAGT 7980 

TATCATCTTG TAATGAGTTA AGCAACATTC TTGCAATCTA TTTTACTTTA TATCACAATT 804 0 

AATTAGTCAA ATATTGATAA ATCAATAAAA AGAGAGGGGA AGAAATGCTA GAGATTCAAG 8100 

ATTTACTGTA TCAACTCCGC TTGTCTGAGC AAGCGAGTAC GCAATTGTTT GAAAAAAGGC 816 0 

TTGGGATTAG TTTGACACGG TATCAGATTT TACTGTTTTT GCTGGAGCAT TCTCCTTGTA 822 0 

ACCAAATGGC GGTTCAGGAG CGTTTGAAAA TTGATCAGGC TGCTTTGACA CGGCATTTCA 8280 

AAATTTTGGA AACGGAAGGT TTGGTGGAGC GTCATCGTAA TCCTGAAAAT CAGCGGGAAG 8340 

TGTTGGTAGA GGCTGCGAAG TATGCCAAGG AGCAGTTAGT GGTGAATCCC CCTCTGCAAC 8400 

ATATCAGGGT TAAGGAAGAG ATAGAAAGTA TCTTAACAGA GTTTGAGAGA ACAGAACTCA 84 60 

GCCGTTTATT AAATAAATTG GTTTTGGGTA TTGAAAATAT AGAAATTTAA GGAGAAATAG 852 0 

ATGTCAATTA TTTTAACAAC GATCGTTGCT TTGGAGCATT T T T AC ATT TT TTATTTGGAA 85 8 0 

AGTATTGCCA CGCAATCAGA TGCGACTAGT CGTGTATTTA ATATGGAAAA GGAAGAATTG 8 64 0 

GCTCATCCGT CAGTAAGTTC ATTGTTCAAA AATCAAGGAA TTTATAAGGC TCTGCTAGGA 8700 

GTCTTTCTCT TGT ATGT CAT TTATTTCTCA CAGAATTTAG AAATTGTGAC TATTTTTGTC 87 60 

TTATTTGTGA TTGGTGCTGC G AC TT ACGGC TCTTTAACAG CGGATAAAAA AATTATTTTG 8 82 0 

AAACAAGGTG GATCAGCTAT TTTGGCCTTG ATTAGTATTT TACTCTTTAA ATACACTTGA 8 880 
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AGGTCGATTC TAATCTCGCT AATCCTTTTT AATCCAGAAT AAGGGAAATA TGTTATACTT 8940 

GTTTTTAAGA AAAAAGTCTC ATTGAATTGG TTTTGAGGAG TTAGAAATGA AAGT AT T AG T 9000 

GACAGGTTTT GAGCCCTTTG GAGGGGAAAA GGGCAATCCA GCTTTGGAGG CCATTAAAGG 90 60 

TTTACCAGCT GAAATCCATG GTGCTGAGGT CCGTTGGCTA GAGGTGCCGA CAGTTTTTCA 912 0 

CAAATCTGCT CAAGTATTGG AAGAAGAGAT GAATCGTTAT CAACCTGACT TTGTCCTTTG 9180 

TATTGGGCAA GCTGGTGGAA GAACTAGTTT GACACCTGAA CGAGTGACCA TTAATCAAGA 92 40 

CGATGCATGC ATTTCTGATA ACGAAGATAA TCAACCGATT GACCGTCCCA TTCGCCCAGA 9300 

TGGTGCTTCG GCCTACTTTA GTAGTTTGCC GATTAAAGCG ATGGTTCAAG CTATAAAAAA 93 60 

AGAGGGCTTA CCGGCCTCTG TTTCCAATAC GGCAGGGACT TTTGTCTGCA GCCATTTGAT 942 0 

GTATCAGGCT CTCTATTTGG TAGAAAAGAA ATCTCCATAT GTTAAGGCAG GTTTTATGCA 9480 

TATTCCTTAT ATGATGGAAC AGGTGGTGAA CAGACCGACT ACTCCAGCTA TGAGTTTAGT 9 54 0 

GGATATTCGG CGAGGGATAG AAGCAGCAAT CGGCGCTATA ATAGAACATG GAGATCAGGA 9 6 00 

ACTCAAGTTG GTAGGCGGAG AAACTCATTG ATAGAAAAAA GCTTGAGGGG AAAAACCTTC 9 6 60 

AAGCTTTTGG ACGTTTTCGG GCCAATACTG CTCGGTAAAA CATAATTTTA GTGCATTGGA 972 0 

TATAAGGTAG GAGTGAAAAA CTAGCAATGC CAAAGGTAAT CCAATTGAGG AAGTACCAAG 97 80 

GAAGAAGCTG TAAATCTAGG ACAAAGTGCT GGAACTTGTA GCCCTTCATA AAGGAACGGC 9840 

TAGTTTTTAG GATTCGTCTT GGTGGGACCT GTCCTAGGTC TAGACTATAA CAGAGAAGAA 9 900 

ATTCCACCTG TGAATAGGCA TAATACTGTG GAATATAGAG GATATTTCCT ACAATGATCA 9 9 60 

AGATGAGACT TGCAAGAAAG TAGAGTCCAA AGACCATGAG GAAACGCTCG GTTTCAACTG 10020 

ATGAGAGATC TAGATTTGGA AACTCAGGAT GTAGGGTGAC GAATTTTTTG GCTAAAAAGC 10080 

TACTATAAAA GAGGAGGTAA ATCCCAAGTA AATTAGGGAT AC T C CAT AAA AAGAGATAGA 1014 0 

AACGTTTGAG AAGTAGGGTC AAAAAGGTTT G AG AAAAGC G CTCCTCATCA AAGAGAGCTA 10200 

GGCTGTTTTT T AC AG AT G GC TCCGTTTTAG AATCTTTCAT GAGTGTCAGT GTTGCATAGA 10260 

CGGAACTGGT CAAAAGAATA GTCCCGATAA AGGAGACTAG TAGAGGAAAG AGGTAGGTTT 10320 

G AAGT ATT TG GCCAAGTATG CTGAAAAATG GCTGTTCTAA AACAGTCCCG TGGATCCGAG 103 80 

ATAAGGGATT AAGAAAACCA GATAAGATGA CC AG CAT ACT GGGAAGGATA TAGAGGAGAA 10440 

AGAGACGGGG GGTGTCAGCC TGAAAATGTT TTGACTCCTG ACGAATTGTT TTTAAATCAA 10500 

TTTTTGGATA GTTCATTCTC TTATTATACC ATAGTTCTTA TACATAGTTC GTGACAGTTC 10560 

CTACTTTTTT TGATAAAATC ATACAGTGTG TCCTTGGGCA CACTGTATGA ACTGGGACTG 10 62 0 
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TCTTTCCCAG CTTCGGAGGT AAAAAATGTC AGATTCACCA ATCAAATATC GTTTGATTAA 106 80 

GAAAGAAAAA CACACAGGAG CTCGTCTGGG AGAAATCATC ACTCCCCACG GTACCTTTCC 10740 

GACACCTATG TTTATGCCAG TTGGGACACA AGCCACTGTC AAAACTCAGT CACCTGAAGA 10800 

ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GGCTTCGCCC 108 60 

TGGAGATGAA CTCATTGCAC GCGCTGGTGG TCTCCACAAG TTCATGAATT GGGACCAGCC 10920 

TATCTTGACA GATAGTGGTG GTTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 10 980 

AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 11040 

AAAAGCCATC TCTATTCAGA ATAATCTGGG TT C AG AC AT C ATGATGTCCT TTGATGAATG 11100 

TCCTCAGTTT TATCAACCTT ATGACTACGT TAAGAAATCG ATCGAGCGTA CCAGCCGTTG 11160 

GGCTGAGCGT GGTTTGAAGG CTCACCGTCG TCCACATGAC CAAGGTTTGT TTGGAATTGT 1122 0 

GCAAGGTGCA GGATTTGAAG ACCTTCGCCG CCAATCAGCT CATGATCTTG TCAGCATGGA 112 80 

TTTCTCAGGC TACTCTATCG GTGGTTTGGC AGTGGGAGAA ACCCATGAAG AGATGAATGC 1134 0 

GGTCTTGGAC TTTACAACTC AACTGCTGCC TGAAAATAAA CCTCGTTATC TGATGGGTGT 11400 

GGGAGCGCCA GATAGCTTGA TCGATGGGGT CATTCGTGGG GTGGATATGT TTGACTGTGT 114 60 

CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GTTTGGTTGT 11520 

GAAAAATGCC CAGTTTGCTG AGGACTTTAC GCCACTGGAT CCTGAGTGTG ATTGCTACAC 11580 

ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AAGGCTGATG AAACCTTTGG 11640 

TATCCGCTTG ACTAGCTACC ACAATCTTTA CTTCTTGCTT AACCTGATGA AGCAAGTGCG 11700 

AC AAGC C AT C ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTGG AAAAATATGG 117 60 

CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAAGTTTT 11820 

CTCTTAGGAT TTTTCTTCTT TTTTTGATAG AATAAAGTGT ACAATGAAAG GAAGAATAAA 11880 

CTCGTATGCG CATTAAATGG TTTTCCTCGA TTAGG 11915 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTGGT TTGCAGATAT TCAAGAATCG 60 

GGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA GC AT C TC AAA AATACGGACA 12 0 
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GCATCCTCCA TCTTTTTCTG GCCTTCCTTG ACTCTACCTT GCTTGCTATC AAGGAGACCT 180 

TCTGCCCACA GATAAACAAT TCGGAAATAG GTCTCATTTT CCTTGTAGAA ATGCTCTTCG 240 

ATAACACGTT TAAAATAATA GGCATTGGTA AATTCTTCAC ACTCAATACT AGCTAAAAAG 3 00 

CCATTCAATA GTATAGTATG AAAAAGGTTT CGATTGCCAG ACATTTCCAT T AG AAAAT C A 3 60 

GATTTACGTA CCATTTCTCG TACATATCTA GTAAAAAGAG AAACAGATAA AAATGGAGAA 42 0 

CTGACTGAAA ATAAATTGAG TTCATAGATT CCCCAGATCT CGGTAGAAAA CAAATAATCA 480 

TGAAGGACTT TTCCTTCCTC TGCTGTTAAG TCTACCCTTT CATCTATGCT C T T C AT AT AA 540 

GACTTGATAA TAATGGCATT TAGAATATGT TTCTGTTTGT TGTGAGAATG GGCATGCTTT 600 

TATACTCCCT GCGATATAAG TCCTCAAGAG GTGCTATATT CTTTGGTTCC AAGACATCTG 660 

TAATTTCTTT TCTCAACTCA GAATCTGTAT CATACTGGAA ACCTCTTGCC AGAAAGAGGA 72 0 

TCTCCTCCAC AC T GGC AG AT ATATTTTCCA GAGCAAATAG AAACTTTTCC ACCGAAAGCT 780 

CACTCTGACC TGTTTCAAAA CGGGACAACA TAGACGGCGA AAATTGTCCT CCGGTTGCTT 84 0 

GTCTCAGTGA GATATTTCTT GACTCTCGTA ATTGTCTAAA GACTTTTCCA ATCTGCTCCA 9 00 

TAGACTTCCC CTTGATTCCG TATTTTCTTC ATTTTATCAT ATTTTTCAGA AAATTCATCA 9 60 

AAAACTTGCC AAATTGTCAG AAT TAT GAGA AAATAGAGGA TATTTATCAC GTGGAGGGAC 102 0 

TGCTATGAGA G ACG AT AT C A AAATCAATGA CCGTGCTTTG GCCTTGCAAG ACCAAATTAT 10 8 0 

CGAAAAACTA GAGAAAGTTT TTGATACAGA TGTGGAATTG GATGTTTACA ATCTAGGTCT 114 0 

GATTTATGAA ATCAATCTGG ATGAAACGGG GCTCTGCAAG ATTGTCATGA CCTTCACCGA 12 00 

TACTGCCTGT GATTGCGCCG AAAGCCTGCC TATTGAAATC GTGGCAGGTC TGAAACAAAT 12 6 0 

CGAGGGTATC AAAGAT AT C A AGGTTGAAGT TACCTGGTCG CCTGCTTGGA AAATCACACG 132 0 

AATCAGTCGC TATGGCCGTA TTGCCCTTGG ACTACCACCT CGTTAAGCAG ACCAATCACT 13 8 0 

TT T AAAGAT G AAAATCAAAG GG C AAACT AG AAAACTAGCC GCAGGTTGCT CAAAACACTG 1440 

TTTTGAAGTT AT GG AT AG AA CTGACGAAGT CAGCTCAAAA CACTGTTTTG AGGTTGTGGA 1500 

TAGAACTGAC GAAGTCAgCT CAAAACACTG TTTTGAGGTT GTGGATAGAA CTGACGAAGT 15 60 

CAGCCCAAAA CACTGTTTTG AGGTTGTGGA TAGAACTGAC GAAGTCAGTA ACCATACCTA 162 0 

CGGCAAGGCG ACGTTGACGT GATTTGAAGA GATTTTCGAG TATGAGTTTA TTTTTTACCT 1680 

GACTTGTCCA TAT T C C AG AA GTCTGTCACG GCTCCGCGTG AAGCAGATGA TACGATGTGG 174 0 

GC AT AT TT AC CGAGGACACC ACGGCTGTAA AGTGGTGGCA AGGTTGTTTC TGCCTTGCGT 1800 

TTTTCAAGTT CTTCTTCGGA TACGGCCATA GAAATTTCTT TGGTATCTTG GTCAACCGTA I8 60 
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AC G AT AT CGC CGGTACGGAG ATAGGCAATT G GT C C AC CAT CCTGAGCTTC AGGAGCGATA 1920 

TGTCCAACAA C C AG AC CAT A AGTACCACCA GAGAAACGTC CGTCCGTCAA GAGGGCCACC 198 0 

TTATCTCCCT GACCTTTACC AACAATCATT GAAGAAAGTG AT AG C AT CT C AGGCATACCA 2040 

GGACCACCTT TAGGTCCAAC AAAACGAACA ACGACTACAT CGCCATCAAC GATTTCATCT 2100 

GTCAGAACGG CCTGAATCGC ATCTTCTTCT GAGTCAAAGA CCTTAGCTGG CCCAACGTGA 216 0 

CGACGCACTT TAACACCTGA TACCTTGGCA ACTGCACCGT CAGGAGCAAG GTTCCCGTTC 222 0 

AAGATGATAA G C GG AC CATC CGCACGTTTT GGATTTTCAA GTGGCATGAT AACTTTTTGG 22 80 

CCTGGAGTCA AGTCTGCAAA GTCAGCCAAG TTTTCAGCTA CAGTCTTACC AGTACATGTG 234 0 

ATGCGATCTC CGTGAAGGAA ACCATTTGCC AACAAATACT TCATAACCGC AGGGACACCA 2400 

CCGACTTCGT AGAGGTCTTG GAAGACATAC TGACCAGATG GTTTCAAGTC GGCCAAGTGA 24 60 

GGCACACGTT CTTGAATCGT AT TGAAGT C C T C AAGTG AC A AGTCAACATT TGCGGCATGG 252 0 

GCAATGGCGA GCAAGTGAAG AGTGGCGTTT GTAGAACCAC CGAGAGCCAT CGTTACAGTG 2 580 

AT AG C AT CTT CAAAGGCTTC ACGAGTCAAG ATATCTGATG GTTTGAGACC AAGTTCCAAC 2 640 

ATCTTAACAA CAGCACGTCC TGCTGCTTCG ATATCTTCTT TCTTATCAGC TGATTCAGCT 27 00 

GGGTGAGAGG ATGACCCTGG CAAACTCATC CCTAGAACTT CGATAGCAGT TGCCATGGTA 27 60 

TTAGCAGTAT ACATACCACC ACAACCACCA GGGCCAGGGC AGGCATTACA TTCAAGACGT 2 82 0 

TTCACGTCCT CAGCTGTCAT GTCACCGTGG TTCCATTTTC CGATACCTTC AAAGACAGAA 2 880 

ACCAAGTCGA TAT CTT T AC C ATCAAGATTT CCCGGTGCAA TAGTTCCACC ATAGGCGAAA 2 94 0 

ATAGCTGGGA TATCCATATT AGCAATAGCA AT C AT AGAT C CAGGCATGTT CTTGTCACAG 3 000 

CCACCGATAG CGACGAAGGC ATCCACGTTG TG AC C ACT C A TAGCCGCCTC GATGGAGTCC 3 060 

GCGATGATGT CACGAGATGT TAGAGAGAAA CGCATACCAG GCGTTCCCAT AGCGATCCCG 312 0 

TCCGCTACGG TAATGGTTCC AAACTGTACA GGCCAAGCGC CTGCAGATTT GACACCTTCT 3180 

TTAGCCAGTT TCCCGAAATC ATGCAAGTGA ATGTTACATG GTGTATTTTC CGCCCAAGTC 3240 

G AAAT C ACT C CCACAATCGA TGTTTCAAAG TCCTTATCTG T CAT AC C AGT CGCACGAAGC 3 300 

ATAGCACGGT TAGGTGATTT AACCATGCTG TCATAAATGC TACTGCGGTG ACGTTTATCT 33 60 

AATTCAGTCA TCTTATCCCT CCCATTTCAG TTTTTACTAT TATAGCACAA TTTTCGCATG 3420 

AAGAACAGAA TAAAATTCTT GAATTTTCAG AAAATTCTAT ACACATGTGA AATATTTAAA 3480 

ATTAAAAACA ACAAAGCGGA TTAGTGCACT TTCTGATGAC CAGAATATGC TTTTTAATCC 354 0 

GCTTTCTTTA AATAACGTAC TGTAATTTTT AC AG AAATT C TTTCAAATAA GTGTATTTAA 3 600 

CATCTATCTT GCATTATAAA TTTCTAGAAC CTTCTCTTTT ATATTCGATT CACTCAAACC 3 660 



WO 98/18931 



PCT/US97/19588 



743 

ATACTCATTA AGAAGATAAT CCATTTTCCC T ACT T G AC C G AATCTTTCTT GAACACCCAT 372 0 

CCGATGAATT TTTGTTATTC CATCATCAGA GAATAATTCA CATAAAGCAC TGCCAATTCC 3 7 80 

ACCTATCTGA TTGTGGTTTT CTACAGTAAA TATAGTTTTT CCACTTAACA TTGTTTTTAT 384 0 

CTGTTCTGGT ATCGGTTTGA TTCTAAATAA AT CT AT C AC A CCTACTGAAT AACCTAATTT 3 900 

AGACAGTTCA TCTGCAACTC GAATACTTGG AGCAACCATT ATGCCAGAAG CAACGATTAC 3 960 

AAGATCTTCA CCATGCCTTA ACTCAATGTA GCCTTTAGAA AAATCTTCTC CACCTTGATA 402 0 

CACAGGAACT GGAGCTTTTC TAATTGTTCG AATATATTTT AGTCCTTTTA AGTCTAATGT 4080 

CTGGTTCAAT ATTTCACGAA ATTGGATATC ATCAGTTGCT TCGAAAATGA TTGATTTAGG 414 0 

AATTAAACGT AACAATCCAA TTTCTTCAAA TGGCATATGT GT T C C AC CAT TCATCTCTGC 42 00 

CGTTACTCCT GCATCTGATC CAATCACAGT GGCATCCAAT TGTGCGTATC CAAGAGAAAT 42 60 

AAATAATTGA TCAAATACTC TTCGTGAAGC AAAAGG AC C A AATGTATGAA GATAAGGTCT 4 32 0 

AAACCCCTGA ATAGACAAGC CTGCTGCAAG GCCGACCATT TCTGCTTCCA TAATCCCAAC 43 80 

ATT C AC AT AA CGGTCTCCAA AGTCCTTTTC AAGATTATTA GTAGCCATCG AACTTGACAA 4440 

ATCGGCTTCT AAGACTACTA TATCAGAATC ACTTTGATTA GCCTCTAAAA GGAAGTCTCT 4 500 

ATATACATGC CGTAATTCTT TCGTACTTCT CATCATTCTG TTTCCTCCAA TTCCTGACTT 4 560 

AATCTTTCTA CAACTGAAGT TAACATTTGT TTCTCCTCTA CAGTAGGGCG AAGATGATGA 4 62 0 

TTGGATTTCA TTTCTTCCAG CTCTTGAACC CCTTGACCTT TAATAGTATC TAATACAATG 4680 

CACTTAGGTG ATGAATTATT TGACTGTTTT AATTGGACAA TCCCTT CATA AATTTCTCTA 47 4 0 

ATATCTGAAC CCTTGACCCT AATGGATTCA AATCCAAATG CTGAAAATTT TTCTACGAAA 4 800 

TCACCTGGAT T AC AAAT AT C CTTTGTAAAA CCATCTAATT GTTTTTTGTT AT C AT C AAC A 4 8 60 

AATACAATTA AGTTGGATAA CTGTTGATGA GAAGCAAACT GT AT AG C C T C C C AAC AT T G T 492 0 

CCCTCATTTA ACTCACCATC TCCAACAATA GCGTAAGTAT AAAAGGGACT CTTTCTTATT 4 9 80 

CTCTGACCAT ATGCAAGTCC AGTTGCAACA CTAATTCCTT GTCCTAAAGA GCCCGTTGTC 504 0 

ATATCTATGC CTGGCGTTAG ATTTCTATCA GGATGAGACG GTAATTTGGT TCCATTTGTA 5100 

TTTAAAGAAT AT AAGAATT C TTTGTCAAAG AAACCATTCA AATAGAGTGT ACTGTATAGA 5160 

GCTGGTCCTC CGTGACCTTT TGATAATATG AAATAATCTC TATCTCGTGC TGCAAATATT 522 0 

TCTGGAGTCA TTGGCATTAT TTCACCATAA AGCACCGCTA AAACTTCTAC GATAGACAGA 52 80 

CTTCCTCCGT AATGTCCGAA TCCAAGATGA TTCAATGTTC TAAGAGTATT TAATCGGATG 534 0 

TTAGTCGCAA ATTTTCTTAA CCCATCTTCT CTATTTTTAC T T AAAAT CAT CCCTTATTCC 5400 
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TCCGTTGCAG ATGGCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 54 60 

AG AC C AAT C A CAATGCCTGC TTGTGAGCCA AATTGATTTA ACATTCCTAA AATAATTCCT 5 52 0 

GATAGACCAA AATCT G CAT C TGAGAAAGTT GATCCTTGGA AACCAAGTCC TCCCAAAACT 55 80 

GGCATTAAAA AG AC TGG AAG AAAACTG AT T AAAATACCTT GTAAAAATGC TCCAATAGTG 5 64 0 

GCTCCACGAA CACCACCAGA TGCATTCCCA ATGACACCTG CAGTCGCTCC ACAGAAGAAA 57 00 

TGAGGCACAA CACCTGGTAA GATAACAACC GTTCCTGAAG C AAT CAT AAT T AC CAT AC T T 57 60 

ACTAAACCAC CAACAAAACT AGAGATAAAT C C AAT T AG AA CTGCATTGGG TGCATAAGTA 582 0 

TAAACAATCG GACAATCCAA AGCAGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 5 880 

TTAAAGGCTG GAACAATTTC GCCCAAAATA AGGCGAACAC CTGCTAAAAT AACAAATACC 5940 

CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA GACCACTTGT ACCACTACTG 6000 

ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 6060 

ATGGATAAAG TAATACTAAC AGT AC T AT C A CGTAAAAAAG CTAAACTCTT TGGAAATTTA 612 0 

ATGTCCTCTG TTGATTTTGA TTTGTCACCG AT AAGG C T AC CAGTAAAACC ACTCAACCAA 6180 

TATCCCAAAG AACTGAAATG ACCTAAAGCT ACCTTGTCAT TTCCAGTTAA TTGAACCATA 6240 

TATTTTTGCA CAAATGCTGG GGAAATACTC AT AAT AAT AC CGAGTGCTAA TCCTCCTAGT 63 00 

AAG AT GAG AG GCAAGCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCC 63 60 

ATATATAGAG TGTGGTGCCC TGTTAAAAAA ATATATTTAA AT C G AGT AAA ACGAGCGATT 6420 

AAG AT ATT G A ACACCATGCC TGCAAACATA ATCATTGCAG TAGCTGAGCC ATATGTTGTT 64 8 0 

AAAGCTACAG CTACAATTGC TTCATTATTC GGCACAACGC CAGATAAATG AAAAGCATGC 6 540 

TCAAACATGG TACCAAATGG ATTTAAAGAA TTTTGTACAA TTCCTGCACC ACCAGATACA 6600 

ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 6660 

TGAAGAACTA AT C CT AAG AT TGCAATTAAA GCTACTAAAA TAGCTGGTGT ACTAACAATA 6 72 0 

TCCAATATGA ACTTCATCAT GACGCTAGCC T C C T AT AT AA GTCCTTTTTC TTCACAAAGT 67 8 0 

TTAGTAATTA ATTCTCGTAG TTCATCCATA T C AAT AAT AC TATTTAAGAT ACGAACATCT 6840 

CC AAG AT G AC TAGCTGAATC AGCT AG AT C A CGACCAACAA TCCAAATATC AGCTGCATTT 6900 

GGATCTGCTC CACCTAAATC AT AATGTT C A ACTTCTACAT CCGAAACATT CAAATCACTC 6960 

AATACAGATT CAATATTCAT CTGTACCATA AAACTTGAAC CTAATCCTGA ACCACAAGCT 702 0 

GT AC C AAT T T TTAACATTAT CTAATCCTCC TGTTTAATTA TCATTTTAAT GTCATCATAG 7080 

TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 7140 

GACAAAGCCT TTAAATGACT CTCATTATCA ATGGCTGCAA TACAAATCAA CAATCTTACC 7200 
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TCTTGTTCTG GATTATCCAA TAAATAAATC GGTTCTTCCA AAACTAACAT TGACATTCCT 72 60 

ATTTCATTCA CACCTTCATC TGGCCGAGCG TGAGGAATTG CTACTCCCTT CCCTAAATTA 7320 

AT AAAAGGT C CAAACTCTTC TACTTTTTGA ATCATTGCCT CAGGGTAGTT CTCAGTTATC 73 80 

TTATCTTGAT CCAAAAGCGG TTTAGCTGCT AAACGAATCG CCTCCTTCCA TCCTAATTTT 7 44 0 

TGCGAACTAA CCTGATAGGT TTCTTTGGTA ATAAGTTGTT CTAGCACTGG TACAATTTCC 7500 

TT T C TAT CAT TTTTTTGGTA AAGATAATTC TTTAACGCCA ATCTTAATTC CAATTCTTGT 7 5 60 

GTAATAATTC CATATCTTTT GACAATATTC AGGATTTGTT CAATCTCAAA ATCTC C AT AC 7 62 0 

TCTAAATTCG GAAAATCTTT TAACACTAGT TCTACTAGTT GTATTGCTTG CTCTTCAGTC 7 680 

ATCATAACCG AAACTAGATA ATTTGGCTTT TCTGTCTCCA CCTTTATGGT AGAAAAAACC 774 0 

AT AT C AT AGT CACTACTAGC TTTCACCTGT AAATCATCAA TCTTTGAGGT TCCTATAAAC 7 8 00 

TCAATTTGAG GAAATAATGC TAATAGATTC TCTTTTAACA TCAATGAAGA ACTAACACCA 7860 

TTAGGACAAA TGATTGCTGC TTT AT AC CAT TTTTGAGGCA AAGTATCTGC TTTCTTTAAA 7 92 0 

TAACCTCCGA AATGGATAAC AAAATATGCT GTTTCACTAT CAGGTATGGG ATTGTCAATA 7 980 

GCGTCCATCA AGGGCATCAA AGAATCTTTG ACTAATTCAA ATAAATCAGG ATAATGTTCT 8040 

TTAACATGCA AT AC AT AT T C ATTTGAACTA GGTAGGCCGA ACTTTAATCT ATAGTAAGCC 8100 

GGTATAAGGT GGCGGCGAAG ATTTTCTCTC AATCCTTCCC TTTGTTTAAA ATGTAACAAA 8160 

GAAATATCTT C C ATTC T ACT TATAATAGCC TCTGTTAATT GATTAAAGTA AACCGGAGCA 8220 

ACATCTACTT CACCTTCAAA GCAACTTGAT AATAAAACGG TGATATAGCG AT AAT CAT C C 82 8 0 

TCAGAAAACA CCGTATCTAT AATT CCC AAA TCAACCACTG TAT C C AAT AA AATAGTGGTT 83 4 0 

ATATCTTGAA TAACAGGAGA TACTAATGTC TCTGAAAGAC ATACTCTTTC AACATCCCTT 8400 

TGATACCTAC ACAGAATGAA TACTAAACCG AAAAGGT AAA CTTTTAATTG ATTAACAATA 84 6 0 

GGT ACT AG CT GTAGCTTCTC AT AAT AAT CT TTAACTACCT GATCAATCAA ATCATAAGTT 852 0 

AATGAAT AC C CCCAACTGGA TAAAACATAA TCCAAACCCC AAATCCCTAT GGAGGATTCC 8580 

AGCAACTCAC TAACCATTTG AAAAGCTAAG CGGTGCTTAT TCCACTCTGA ACCGTGTAAA 8640 

GTATAACCTT TTGCTCTACT GTACCCTAGC T C C AAAT CAT TATCTAACAT AATCTTTCTT 8700 

AATGATT G AA TATCAGATAA GGTTGTATTC TTACTTACTT TCAAAAAGTC TTGGTAATGA 87 60 

CTATTCGATA TAAAATCTAA TCGGCAAAAA GTGTAAAGAT AG ATT AAAG C TAAGCGAGTC 8820 

GACTTTGGTA AAACCAATTC ATCCGACTTA AT AAT AT CTG TCAAAGACTG CTTCGTACGA 8880 

TTTGATAAAC TATAGCGACC TTGCTTTTTA TCCAGCACTA TCCCTTTATT AGCTAGATAA 8 940 
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GGCACTAAAT AATCTATTCC TTCTTTGACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT 9000 

AATTCATATA ACGATAGCTC ACAATGATCC ATCAAAGTCA TCAAAATAAC TAGTGCTCTA 9 060 

TAATCAAAC 9069 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 98: 
CGAGACAACA AGATGAAGAA AAATTTGCCC TATCGTTTGT GGCGCTTGCA AGTGTAGCAC 60 
TTCTTGCAGC CTGTGGAGAA GTGAAGTCTG GAGCAGTCAA CACTGCTGGT AACTCAGTAG 12 0 

AGGAAAAGAC AATTAAAATC GGGTTTAACT TTGAAGAATC AGGTTCTTTA GCTGCATACG 180 
GAACAGCTGA AC AAAAAG G T GCCCAATTGG CTGTTGATGA AATCAATGCC GCAGTGGTAT 2 40 

CGATGGAAAA CAAATCGAAG TAGTCGATAA AGATAATAAG TCTGAAACAG CTGAGGCTGC 3 00 

TTCAGTTACA ACTAACCTTG TAACCCAATC TAAAGTATCA GCAGTCGTAG GACCTGCGAC 3 60 

ATCTGGTGCG ACTGCAGCTG CGGTAGCGAA CGCTACAAAA GCAGGTGTTC CAT T G ATCT C 42 0 

ACCAAGTGCG ACTCAAGATG GATTGACTAA AGGTCAAGAT TACCTCTTTA TTGGAACTTT 480 
CCAAGATAGC TTCCAAGGAA AAATTATCTC AAACTATGTT TCTGAAAAAT TAAATGCTAA 54 0 

GAAAGTTGTT CTTTACACTG ACAATGCCAG TGACTATGCT AAAGGGATTG CAAAATCTTT 600 
CCGCGAGTCA TACAAGGGTG AAATCGTTGC AGATGAAACT TTCGTAGCAG GTGACACAGA 660 
CTTCCAAGCA GCCCTTACAA AAATGAAAGG G AAAG ACT T T GATGCTATCG TTGTTCCTGG 720 
TTACTATAAT GAGGCTGGTA AAATTGTAAA CCAAGCGCGT GGCATGGGAA TTGACAAACC 780 
AATCGTTGGT GGTGATGGAT TCAACGGTGA GGAGTTTGTA CAACAAGCAA CTGCTGAAAA 840 
AGCATCAAAC ATCTACTTTA TCTCAGGCTT CTCAACTACT GTAGAAGTTT CAGCTAAAGC 900 
TAAAGCCTTC CTTGACGCTT ACCGTGCTAA GTACAATGAA GAGCCTTCAA CATTTGCAGC 9 60 

CTTGGCTTAT GATTCAGTTC ACCTTGTAGC AAACGCAGCA AAAGGTGCTA AAAATTCAGG 1020 

TGAAATCAAG AATAACCTTG CTAAAACAAA AGATTTTGAA GGTGTAACTG GTCAAACAAG 1080 

CTTCGATGCA G AC C AC AAC A CAGTCAAAAC TGCTTACATG AT G AC C ATG A ACAATGGTAA 114 0 

AGTTGAAGCA GCAGAAGTTG TAAAACCATA ATAGAAAAAT GTTGAAATAG GGAATGAGCC 12 00 

TTTGACTCAC TCCCTGTTTC GATATTTAAT ACTCTTCGAA AATCTCTTCA AACTGCGTCA 12 60 
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ACGTCGCCTT GGATTATATA TGTGACTGAC TTCGTCAGTC TTATCTACAA CCTCAAAGCA 132 0 

GTGCTTTGAG CAACCTGCGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATAA 13 8 0 

G AACCT AT C A AAAAGTGAGG GAAAACCCTC GGAATTATAA ATAGAAAGAG TGAATCTTAT 144 0 

GCTCCAACAA CTCGTAAATG GTTTGATTCT AGGTAGTGTT TACGCGCTGT TAGCCCTAGG 1500 

ATATACCATG GTTTACGGAA TTATCAAGCT CATCAACTTC GCCCATGGTG ATATTTATAT 15 6 0 

GATGGGAGCC TTTATCGGTT ATTTCTTGAT CAATTCTTTC CAAATGAATT TCTTTGTAGC 1620 

GCTTATTGTA GCTATGCTAG CGACAGCTAT TCTTGGTGTC GTGATTGAGT TTCTTGCTTA 16 80 

CCGACCTTTG CGCCACTCTA CTCGTATTGC TGTTTTGATT ACGGCTATTG GGGTTTCTTT 174 0 

CCTATTGGAG TATGGAATGG TCTATCTGGT TGGTGCCAAT ACCCGTGCCT TCCCTCAAGC 18 00 

GATTCAAACA GTTCGATATG ATTTGGGACC AATTAGCTTA ACAAATGTGC AGTTAATGAT 18 60 

TTTGGCCATT TCCTTGATTT TGATGATTTT GTTACAAGTC ATTGTCCAAA AGACTAAGAT 192 0 

GGGGAAAGCC ATGCGTGCAG TATCAGTAGA TAGCGACGCG GCGCAATTGA TGGGGATCAA 19 80 

TGTAAACCGT ACGATTAGCT TTACCTTCGC TTTGGGTTCT GCTCTTGCGG GTGCGGCTGG 2 04 0 

TGTTCTGATT GCTCTTTATT ATAACTCTCT TGAGCCTTTG ATGGGGGTTA CTCCAGGTCT 2100 

TAAATCTTTC GTTGCCGCAG TACTTGGTGG TATCGGAATT ATTCCTGGTG CGGCTCTTGG 2160 

TGGCTTTGTG ATTGGTCTAT TGGAAACCTT TGCGACTGCC TTTGGGATGT C AG AT TT C C G 2 220 

TGATGCCATT GTTTATGGAA TCTTGTTGTT GATCTTGATT GTCCGCCCAG CTGGTATCCT 22 8 0 

TGGTAAGAAT GT GAAAGAG A AGGTGTAAAC GATGAAGGAA AATTTAAAAG TTAATATTCT 2 340 

ATGGTTACTC CTTTTGTTAG CTGGCTATAG CTTGATTAGT GTACTGGTTT CAGTCGGAGT 2 4 00 

ACTTAATCTA TTCTATGTAC AGATTTTACA ACAAATTGGA ATTAATATTA TTTTGGCTGT 2 4 60 

TGGTCTCAAC TTAATCGTTG GTTTTTCAGG ACAATTTTCA CTTGGTCATG CTGGTTTCAT 2 520 

GGCGATTGGT GCCTATGCAG CAGCTATTAT TGGTTCTAAA TCACCAACCT ACGGTGCCTT 2 580 

CTTTGGAGCT ATGCTTGTAG GGGCTTTGCT TTCAGGAGCA GTTGCCTTAC TTGTCGGCAT 2 64 0 

TCCAACCTTG CGCTTGAAGG GGGACTATCT TGCGGTAGCA ACTCTGGGTG TTTCTGAAAT 27 00 

TATCCGTATC TTTATCATCA ATGGTGGAAG CCTTACAAAT GGTGCGGCAG GTATCTTAGG 27 6 0 

GATTCCTAAC TT T AC AACTT GGCAAATGGT TTACTTCTTT GTCGTGATTA CAACCATTGC 2 82 0 

AACCTTGAAC TTCTTGCGTA GCCCAATTGG TCGTTCAACC CTCTCTGTTC GTGAAGATGA 2 880 

AATCGCTGCT GAGTCAGTTG GGGTTAATAC GACTAAAATT AAAAT CAT C G CTTTTGTCTT 294 0 

TGGTGCCATT ACTGCAAGTA TTGCTGGGTC ACTTCAGGCA GGATTTATCG GGTCTGTTGT 3 000 
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ACCGAAAGAT TACACCTTCA TCAACTCAAT CAACGTTTTG ATTATTGTTG TATTTGGTGG 30 6 0 

ACTCGGTTCC ATTACAGGTG CGATTGTTTC GGCTATTGTT CTGGGAATTT TGAATATGCT 312 0 

TCTCCAAGAT GTTGCTAGTG TGCGTATGAT T AT TT AC G C T TTGGCCTTGG TATTGGTAAT 3180 

GATTTTCAGA CCAGGTGGAC TCCTTGGAAC ATGGGAACTG AGCCTATCAC GTTTCTTTAA 32 40 

AAAATCTAAG AAGGAGGAAC AAAACTAATG GCATTACTTG AAGTAAAACA GTT AAC C AAA 33 00 

CATTTTGGTG GTCTAACAGC TGTTGGAGAT GTGACTCTTG AATTGAACGA AGGGGAACTG 3 3 60 

GTTGGATTAA TCGGTCCAAA CGGAGCTGGG AAAACCACCC TTTTCAACCT TTTGACCGGT 3 42 0 

GTTTATGAAC CAAGCGAGGG AACAGTAACC CTAGATGGTC ACCTTTTGAA TGGGAAATCA 3 4 80 

CCTTATAAGA TTGCCTCTTT GGGACTTGGA CGTACTTTCC AAAATATCCG TCTCTTTAAA 3 540 

G AT TT AAC AG T T TT AG AT AA TGTTTTGATT GCTTTTGGAA AC CATC AC AA AC AG C ATGTT 3 600 

TTTACTAGTT TCTTACGCTT ACCAGCTTTT TACAAGAGTG AAAAAGAATT AAAGGCTAAA 3 6 60 

GCTTTGGAAT TGTTGAAAAT CTTTGATTTA GATGGTGATG C AG AG AC T C T TGCTAAAAAT 3 720 

CTTTCCTACG GACAACAACG TCGTTTGGAA ATTGTTCGTG CCCTTGCTAC GGAACCTAAA 3 7 80 

ATTCTCTTCT TAGATGAACC AGCAGCAGGT ATGAACCCAC AGGAAACAGC CGAATTGACT 3 84 0 

GAGTTAATTC GTCGTATCAA AGATGAGTTT AAGATTACAA TCATGTTGAT TGAACACGAT 3 9 00 

ATGAATCTGG T CAT GGAAGT AAC AG AAC G T ATCTACGTAC TTGAATATGG CCGTTTAATC 3 9 60 

GCTCAAGGAA CTCCAGACGA AATTAAGACC AATAAACGCG TTATCGAAGC TTATCTAGGA 402 0 

GGTGAAGCCT AATGTCTATG TTAAAAGTTG AAAATCTTTC TGTGCATTAC GGTATGATCC 4080 

AAGCAGTTCG TGATGTAAGC TTTGAAGTTA ATGAAGGAGA AGTTGTTTCC CTTATCGGTG 414 0 

CCAACGGTGC AGGTAAGACA ACT ATT C T T C GCACCTTGTC AGGTTTGGTT CGACCAAGTT 42 00 

CAGGAAAGAT TGAATTTTTA GGTCAAGAAA TCCAAAAAAT GCCAGCTCAG AAAATCGTGG 42 6 0 

CAAGTGGTCT TTCACAAGTT C C AG AAGG AC GCCACGTCTT TCCTGGCTTG ACTGTTATGG 43 2 0 

AAAATCTTGA AATGGGAGCT TTCTTAAAGA AAAATCGTGA AG AAAAT C AA GCTAACTTGA 4 3 80 

AGAAGGTTTT CTCACGCTTT CCTCGTCTTG AAGAACGGAA GAACCAAGAT GCAGCCACTC 444 0 

TTTCAGGGGG GGAACAACAA ATGCTTGCCA TGGGACGCGC CCTCATGTCA ACACCAAAAC 4500 

TTCTTCTTTT AGATGAACCA TCAATGGGAC TTGC CCCAAT CTTTATCCAA GAAATTTTTG 456 0 

AT AT C ATT C A AGATATTCAG AAGCAAGGAA CAACGGTCCT CTTGATTGAA CAAAATGCCA 462 0 

ATAAAGCACT TGCAATCTCT GACCGAGGAT ATGTACTGGA AACAGGGAGA ATCGTCCTAT 4 680 

CAGGAACAGG AAAAGAACTC GCTTCATCAG AAGAAGTCAG AAAAG CAT AT CTAGGTGGCT 474 0 

AAAACAATCC AGTGGATTGT TTTAGTCGGC AGATGGAGAT TACGAAGTAA T CAT C AAT AT 4 800 



